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Preface to the Second Edition 



This is a revised, expanded, and improved version of my textbook, Introduction 
to Queueing Theory. As before, it is written primarily for seniors and graduate 
students in operations research, computer science, and industrial engineering; 
and as before, the emphasis is on insight and understanding rather than either 
cookbook application or fine mathematical detail. Although it has been struc- 
tured for use primarily as a textbook, it should be a useful reference for re- 
searchers and practitioners as well. The second edition reflects the feedback of stu- 
dents and instructors who have used the first edition (which began as a set of 
notes for an in-house course I taught at Bell Laboratories), as well as my own 
experience teaching from the book at the Georgia Institute of Technology (in the 
School of Industrial and Systems Engineering and the School of Information and 
Computer Science), the University of Michigan (in the Department of Industrial 
and Operations Engineering), the New Mexico Institute of Mining and Technol- 
ogy (in the Department of Mathematics), and Florida Atlantic University (in the 
Department of Computer and Information Systems). The objective of the second edi- 
tion is to improve on the first edition with respect to clarity, comprehensiveness, cur- 
rency, and, especially, its utility both for teaching and, self-study. 
In particular, improvements over the first edition include 

1. the availability from the publisher of an instructor's manual containing de- 
tailed solutions, by Berge Tilt, to essentially all of the many exercises; 

2. a new chapter, on simulation of queueing models; 

3. an expanded and more complete treatment of background mathematical 
topics, such as Laplace- Stieltjes transforms and renewal theory; 

4. an annotated bibliography of almost all of the English language books on 
queueing theory in existence; 

5. the placement of exercises in the text just after the discussion they are 
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designed to illuminate, rather than all together at the end of each chapter, 
and the addition of new exercises, including numerical exercises that 
reflect engineering and economic concepts; 

6. the presentation in the text of some of the more difficult and important 
material that was left to the exercises in the first edition; 

7. the integration of the material of Chapter 6 of the first edition ("Waiting 
Times"') into the other chapters; 

8- the inclusion of new material and references published after the first edi- 
tion; 

9. revision throughout to improve clarity and correct errors; 
10. arrangement of the material so that the book can be used either for a 
course that emphasizes practical applications, or for a course that em- 
phasizes more advanced mathematical concepts with an eye toward more 
sophisticated applications or research. 

The first chapter illustrates the nature of the subject, its potential for applica- 
tion, and the interplay between intuitive and mathematical reasoning that makes 
it so interesting. Chapter 2 contains a review of topics from applied probability 
and stochastic processes. Since the reader is presumed to have had a course in 
applied probability, some of the material (such as the discussion of common 
probability distributions) may well be familiar, but other topics (such as elemen- 
tary renewal theory) may be less familiar. The instructor can decide which topics 
to emphasize and which to assign for review or self-study. Chapter 3 covers the 
basic one-dimensional birth- and-death queueing models, including the well- 
known Erlang B, Erlang C, and finite-source models. Care is taken to distin- 
guish between the viewpoint of the arriving customer and that of the outside 
observer. Chapter 4 discusses multidimensional birth-and-death models. This 
chapter is organized by model, each of which exemplifies a different method of 
solution of the multidimensional birth-and-death equations, The methods include 
product solutions, probability-generating functions, inspection, the method of 
phases, and numerical analysis. The models include networks of queues (with 
many examples and references), systems with alternate routing, and systems with 
multiple types or classes of traffic. Little's theorem, Kendall's notation, and the 
Pollaczek-Khintchine formula are mentioned, but detailed discussion and deriva- 
tions are left to Chapter 5; the instructor can refer ahead or not as he sees fit. 
Relatively "advanced 1 ' mathematical concepts, such as Laplace-Stieltjes trans- 
forms and imbedded Markov chains, are not used until Chapter 5. The intention 
is that an elementary course, requiring no knowledge of * 'advanced" applied 
mathematics but nevertheless conveying the main results and the essence of the 
subject, can be constructed from topics covered in the first four chapters. For the 
very practical-minded, a course can be constructed using only Chapters 1 and 3. 

Chapter 5, which is the longest chapter, uses more advanced mathematics and 
is more rigorous in its arguments than the preceding chapters. It covers much of 
the material of classical queueing theory, as well as some more specialized top- 
ics, such as queues with service in random order, which can be skipped without 
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loss of continuity. Also, it includes a summary treatment of Riemann-Stieltjes 
integration and Laplace-Stieltjes transforms, tools that are often used but rarely 
explained in textbooks on queueing theory or applied stochastic processes. An 
intermediate-advanced course for students with prior knowledge of elementary 
queueing theory can be built around Chapter 5. Chapter 6, which is completely 
new, surveys the field of simulation as applied to queueing models. It contains, 
among other things, one of the first textbook treatments of the regenerative 
method in the context of simulation of queueing models, and discussions of two 
structurally simple but nevertheless sophisticated simulation studies. Chapter 7 
concludes with an annotated bibliography of (almost) all the books on queueing 
theory and its application that have been published in the English language. (The 
reader may well turn to Chapter 7 first.) 

Colleagues who have contributed to this book, either directly by their reading 
and commenting on the manuscript or indirectly by their personal and profes- 
sional influence, include Carl Axness, Paul J. Burke, Grace M. Carter, Ralph 
L. Disney, Hamilton Emmons, Carl M. Harris, Philip Heidelberger, Daniel P. 
Hey man, Marcel F. Neuts, Stephen St. John, Donald G. Sanford, Bruce W. 
Schmeiser, Samuel S. Stephenson, Shaler Stidham, Jr., Ryszard Syski, and 
Eric Wolman. Special thanks go to B0rge Tilt who read the manuscript in its 
entirety (through several versions) > made many excellent suggestions, and wrote 
the solutions manual (whose existence is surely one of the major attractions of 
this book). Finally, I want to thank Elizabeth Fraissinet and Dora Yates, who typed the 
manuscript, and Louise Calabro Gruendel of Elsevier Science Publishing Company, 
who oversaw the editing and production. 



Robert B. Cooper 



[1] 

Scope and Nature 
of Queueing Theory 



This text is concerned with the mathematical analysis of systems subject to 
demands whose occurrences and lengths can, in general, be specified only 
probabilistically. 

For example, consider a telephone system, whose function is to provide 
communication paths between pairs of telephone sets (customers) on 
demand. The provision of a permanent communications path between 
each pair of telephone sets would be astronomically expensive, to say the 
least, and perhaps impossible* In response to this problem, the facilities 
needed to establish and maintain a talking path between a pair of tele- 
phone sets are provided in a common pool, to be used by a call when 
required and returned to the pool when no longer needed. This introduces 
the possibility that the system will be unable to set up a call on demand 
because of a lack of available equipment at that time. Thus the question 
immediately arises: How much equipment must be provided so that the 
proportion of calls experiencing delays will be below a specified acceptable 
level? 

Questions similar to that just posed arise in the design of many systems 
quite different in detail from a telephone system. How many taxicabs 
should be on the streets of New York City? How many beds should a 
hospital provide? How many teletypewriter stations can a time-shared 
computer serve? 

The answers to these questions will be based in part on such diverse 
considerations as politics, economics, and technical knowledge. But they 
share a common characteristic: In each case the times at which requests 
for service will occur and the lengths of times that these requests will 
occupy facilities cannot be predicted except in a statistical sense. 



2 



Scope and Nature of Queueing Theory 



The purpose of this text is to develop and explicate a mathematical 
theory that has application to the problems of design and analysis of such 
systems. Although these systems are usually very complex, it is often 
possible to abstract from the system description a mathematical model 
whose analysis yields useful information. 

This mathematical theory is a branch of applied probability theory and 
is known variously under the names traffic theory, queueing theory, 
congestion theory, the theory of mass service, and the theory of stochastic 
service systems. The term traffic theory is often applied to theories of 
telephone and communications traffic, as well as to theories of vehicular 
traffic flow. These two areas share some common ground, and the material 
we shall develop will be useful in both fields. The term queueing theory is 
often used to describe the more specialized mathematical theory of waiting 
lines (queues). But some of the most interesting and most useful models are 
based on systems in which queues are not allowed to form, so that the term 
queueing theory does not seem completely appropriate. The subject of this 
text is perhaps better described by the broader terms congestion theory 
and stochastic service system theory. However, the name queueing theory 
has become most widely used to describe the kind of material presented in 
this text, and therefore I chose to entitle the book Introduction to Queueing 
Theory. 

Historically, the subject of queueing theory has been developed largely 
in the context of telephone traffic engineering. Some of our examples will 
be drawn from this area, but it should be kept in mind that the theory is 
widely applicable in engineering, operations research, and computer sci- 
ence. Also, it is worth noting that the mathematics underlying queueing 
theory is quite similar to that underlying such seemingly unrelated subjects 
as inventories, dams, and insurance. 

Consider the following model. Customers request the use of a particular 
type of equipment (server). If a server is available, the arriving customer 
will seize and hold it for some length of time, after which the server will be 
made immediately available to other incoming or waiting customers. If the 
incoming customer finds no available server, he then takes some specified 
action such as waiting or going away. Such models often can be defined in 
terms of three characteristics: the input process, the service mechanism, 
and the queue discipline. 

The input process describes the sequence of requests for service. Often, 
for example, the input process is specified in terms of the distribution of 
the lengths of time between consecutive customer arrival instants. The 
service mechanism includes such characteristics as the number of servers 
and the lengths of time that the customers hold the servers. For example, 
customers might be processed by a single server, each customer holding the 
server for the same length of time. The queue discipline specifies the 
disposition of blocked customers (customers who find all servers busy). 
For example, it might be assumed that blocked customers leave the system 
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immediately or that blocked customers wait for service in a queue and are 
served from the queue in their arrival order. 

We now use a model of this type to illustrate some important points 
about queueing theory: (1) The subject matter is of great practical value. 
(2) Heuristic and intuitive reasoning is often useful. (3) Mathematical 
subtleties abound. 

Consider two cities interconnected by a group of s telephone trunks 
(servers). Suppose that arrivals finding all trunks busy do not wait, but 
immediately depart from the system, (Technically, no "queueing" occurs.) 
What proportion of incoming calls (customers) will be unable to find an 
idle trunk (and thus be lost)? 

We wish to derive a formula that will predict the proportion of calls lost 
as a function of the demand; that is, we wish to derive a formula that 
allows estimation of the number of trunks required to meet a prespecified 
service criterion from an estimate of the telephone traffic load generated 
between the two cities. The great practical value of any model that leads to 
such a formula is obvious. The fact that such models have been success- 
fully employed has spurred continuing investigations by industrial and 
academic researchers throughout the world* 

We shall now give a heuristic derivation of the required formula, using a 
concept of great importance in science and engineering, that of conserva- 
tion of flow. It is important for the reader to realize that since the following 
derivation is heuristic, he should not expect to understand it completely; 
our "derivation" is really a plausibility argument that, as will be demon- 
strated later, is correct in certain circumstances. With this disclaimer, let us 
now proceed with the argument. 

When the number of customers in the system is j t the system is said to 
be in state E } (y = 0, l,,.,,j). Let Pj be the proportion of time that the 
system spends in state Ey, Pj is therefore also the proportion of time that j 
trunks are busy. Denote by X the call arrival rate; X is the average number 
of requests for service per unit time. Consider first the case j <s. Since calls 
arrive with overall rate \, and since the proportion of time the system 
tpends in state Ej is P p the rate at which the transition E j ^E JJl _ x occurs 
(the average number of such transitions per unit time) is therefore XPj. 
Now consider the case when J = s. Since the state E s+l represents a 
physically impossible state (there are only s trunks), the transition E s -* 
E l+X cannot occur, so the rate of transition E s -+E s+l is zero. Thus the rate 
at which the upward transition Ej~*E J+l occurs is when j = 0, 1, . . . , j - 1 
is zero when j — s. 

Let us now consider the downward transition 

E J+l -+Ej = 0,1,. 

suppose that the mean holding time (the average length of time a call 
holds a trunk) is t. Then, if a single trunk is busy, the average number of 
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calls terminating during an elapsed time t is 1 ; the termination rate for a 
single call is therefore 1 /t. Similarly, if two calls are in progress simulta- 
neously and the average duration of a call is t, the average number of calls 
terminating during an elapsed time r is 2; the termination rate for two 
simultaneous calls is therefore 2/r. By this reasoning, then, the termina- 
tion rate for y'+l simultaneous calls is C/ + 1)/t. Since the system is in 
state E J+l a proportion of time Pj+i* we conclude that the downward 
transition E J+l ^>Ej occurs at rate U+1)t~ 1 P j+1 transitions per unit time 
0=0,1, ...,*-l). 

We now apply the principle of conservation of flow: We equate, for each 
value of the index j, the rate of occurrence of the upward transition 
Ej^E J+l to the rate of occurrence of the downward transition E J+i -+Ej. 
Thus we have the equations of statistical equilibrium or conservation of 
flow: 

\Pj = U+l>- l P J+ i C/=0,1,...^~1). (1-1) 

These equations can be solved recurrently; the result, which expresses each 
Pj in terms of the value P Qy is 

Pj-^-Po 0=1,2,...,*). (1.2) 

Since the numbers {Pj} are proportions, they must sum to unity: 

J» +^,+ -" + (1.3) 



Using the normalization equation (1.3) together with (1.2), we can de- 
termine P : 

Thus we obtain for the proportion P- of time that j trunks are busy the 
formula 

Pj- } XtY/J[ = 0,1,...,*). (1.5) 

2 (Xt)*/*i 

An important observation to be made from the formula (1,5) is that the 
proportions {Pj} depend on the arrival rate A and mean holding time r 
only through the product At. This product is a measure of the demand 
made on the system; it is often called the offered load and given the symbol 
a (a = At). The numerical values of a are expressed in units called erlangs 
(erl), after the Danish mathematician and teletraffic theorist A. K. Erlang, 
who first published the formula (1,5) in 1917. 
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Wheny = ,y in (1.5), the right-hand side becomes the well-known Erlang 
loss formula, denoted in the United States by B(s,a) and in Europe by 

B(s,a)= f /sl . (1.6) 

k = 

We shall derive these results more carefully later in the text. The point to 
be made here is that some potentially useful mathematical results have 
been derived using only heuristic reasoning. The question we must now 
answer is: What, if any, are the conditions under which these results are 
valid? 

More precisely, what assumptions about the input process and service 
mechanism are required for the validity of the formulas (1.5) and (1.6)? 
[We have described the input process by giving only the arrival rate; 
similarly, the service times have been specified only through their mean 
value. It turns out that our conclusion (1.5) is valid for a particular type of 
input process called a Poisson process and, surprisingly, for any distribution 
of service times whatever.] Can the assertion that the downward transition 
rate is proportional to the reciprocal of the mean holding time be justified? 
What is the relationship between the proportion P. of time that j calls are 
in progress and the proportion TI p say, of arriving calls that find j other 
calls in progress? [It turns out that for systems with Poisson input these 
two distributions are equal. Thus, the Erlang loss formula (1.6) gives the 
proportion of customers who will be denied service, that is, lost, when the 
arrivals occur according to a Poisson process.] How widely applicable is 
the conservation-of-flow analysis? How does one handle processes for 
which this type of analysis is inapplicable? 

Questions of this nature sometimes require highly sophisticated mathe- 
matical arguments. In this text we shall take a middle ground with regard 
to the use of advanced mathematics. We shall attempt to impart an 
understanding of the theory without recourse to the use of abstract and 
measure-theoretic tools; the material is presented in a mathematically 
informal manner with emphasis on the underlying physical ideas. Mathe- 
matical difficulties and fine points will be noted but not dwelled upon. The 
material should be accessible to a student who understands applied proba- 
bility theory and those areas of mathematics traditionally included in 
undergraduate programs in engineering and the physical sciences. 

A word about the range of applicability of these models is in order here. 
The example discussed above is based on a telephone traffic application, 
but it should be clear that the identification of the customers as "calls" and 
the servers as "trunks" in no way limits the generality of the model. 

As mentioned previously, telephone applications have provided the 
context for the development of much of queueing theory (and this example 
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illustrates why). We shall occasionally refer to telephone applications, 
thereby illustrating the general principles of the theory with a coherent 
class of authentic examples. Of course, no technical knowledge of tele- 
phony is required. 

In summary, this is a text and reference in queueing theory, a subject of 
both practical importance and theoretical interest. Although the book is 
directed primarily toward the student, some of the lesser-known concepts 
and results from telephone traffic theory should prove of interest, apart 
from their value in illustrating the theory, to engineers and researchers. 
The material is presented with an emphasis on the underlying physical 
processes and the interplay between physical and mathematical reasoning. 
Heuristic and intuitive approaches are used where helpful. Mathematical 
subtleties are observed and are explored where it is profitable to do so 
without recourse to methods of abstract analysis, 

Since the publication of Erlang's first paper in 1909 (which is regarded 
by many as marking the birth of queueing theory), and especially since the 
renewed interest in queueing theory that accompanied the formalization of 
the field of operations research after World War II, about 2000 papers and 
at least 35 books on this subject have been published. It is hoped that the 
reader will find the present book a useful and interesting addition to the 
literature. 

Exercises 

1. In what ways are telephone traffic theory and vehicular traffic theory similar, 
and in what ways are they different? 

2. List some applications of the Erlang loss model, 

3. Discuss ways in which queueing models might be used in the following: 

a. Highway design. 

b. City planning. 

c. Hospital management. 

d. Airport design. 

e. Reliability engineering, 

f. Computer design. 

4. Extend the heuristic conservation-of-flow argument to include the case in 
which all customers who find all servers busy wait until served. 

a. Argue that 




(y=U,. ..,*-!), 



(1) 



Jcope and Nature of Queueing Theory 



7 



where 



b. What restriction must be placed on the magnitude a of the offered load in 
order for (2) to be correct? Give a physical interpretation of this restriction. 

c Show that the proportion of time that all servers are busy (which, in this 
case, turns out to equal the proportion of customers who find all servers 
busy and therefore must wait for service) is given by 



s\(\~a/s) 

V — + 

£ k\ sl(l-a/s) 

This is the well-known Erlang delay formula, also denoted by E^a), which 
we shall discuss in detail in Chapter 3. 

d. What assumption, if any, have you made about the order in which waiting 
customers are selected from the queue when a server becomes idle? Is such 
an assumption necessary? Why? 

e. Show that, if all servers are busy, the probability p } that j customers are 
waiting for service is given by 

^. = (l-pV = 0,1,. „), (4) 

where p = a/s, IThe probabilities defined by (4) constitute the geometric 
distribution.] 

U Suppose it is known that at least k>0 customers are waiting in the queue. 
Show that the probability that the number of waiting customers is exactly 
j + k is p p given by (4). Note that this probability is independent of the value 
of the index k. 

g. Show that Pj=Pj 0=0,1,...) when s= 1. Show that C(l r a) = a. 

Consider the so-called loss-delay system* which is one with a finite number n of 
waiting positions: An arrival who finds all servers busy and at least one waiting 
position unoccupied waits as long as necessary for service, while an arrival who 
finds all n waiting positions occupied departs immediately (and thus is lost), 
a. Using the conservation-of-flow argument, show that the probability Pj thaty 
customers are present simultaneously is given by the formula (1) of Exercise 
4, where now the largest value of the index j is s + n, and 

/>„= 2^ + 7r2 f • 



b. Use these results to obtain the Erlang loss formula (1.6) and the Erlang 
delay formula (3) of Exercise 4, 
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c. What restrictions, if any, must be placed on the magnitude a of the offered 
load when n < oo? When n — co? 

6. Consider a queueing model with two servers and one waiting position, and 
assume that if an arriving customer finds both servers busy and the waiting 
position unoccupied, then with probability p the customer will wait as long as 
necessary for service, and with probability I —p he will depart immediately. 
(As usual, all customers who arrive when there is an idle server enter service 
immediately, and all customers who arrive when the waiting position is 
occupied depart immediately.) 

a. Let Pj (j—Q t \,2 y 3) be the probability that j customers are present simulta- 
neously; let A be the arrival rate and r be the average service time; and write 
the conservation-of-flow equations that determine the probabilities Pj (J — 
0,1,2,3). 

b. Solve the equations of part a, 

c Find the fraction B of arriving customers who don't get served. 

d. Suppose A — 2 customers per hour, t= 1 hour, and p = \ * Evaluate P& P lf P 2 , 
P 3 , and B. 

e. Suppose the manager receives $2,00 for every customer who gets served 
without waiting and $1.00 for every customer who gets served after having 
to wait. How many dollars per hour will she receive? 

t If the manager pays a rental fee of $.50 per hour per server (whether the 
server is busy or idle), and each server when busy consumes fuel at the rate 
of $.25 per hour, what is the manager's total operating cost in dollars per 
hour? What is her profit in dollars per hour? 



[2] 

Review of Topics 

From the Theory of Probability 

and Stochastic Processes 



This chapter reviews and summarizes some aspects of the theory of 
probability and stochastic processes that have direct application to elemen- 
tary queueing theory. The material given here is intended not only as a 
reference and refresher in probability and stochastic processes but also as 
an introduction to queueing theory. It is assumed that the reader is already 
familiar with the basic concepts of probability theory as covered, for 
example, in ginlar [1975], Feller [1968], Fisz [1963], Neuts [1973], and Ross 
[1972]. Specifically, we assume of the reader working knowledge of the 
basic notions of event, probability, statistical dependence and indepen- 
dence, distribution and density function, conditional probability, and 
moment. These concepts will be used freely where required. Familiarity 
with the theory of Markov chains is helpful, but not essential. Birth-and- 
death processes, generating functions, and the properties of some im- 
portant distributions will be reviewed in some detail. Examples will be 
drawn largely from queueing theory. The reader who feels no need for 
review is advised to skim, but not skip, this chapter before proceeding to 
the next. 



2.1. Random Variables 

If an experiment could be performed repeatedly under identical condi- 
tions, then, experience seems to tell us, each of the different possible 
outcomes of the experiment will occur with a long-run relative frequency 
that remains constant. Probability theory represents an attempt to model 
this "observed" behavior of long-run frequencies with a formal mathemati- 
cal structure. As a mathematical theory, of course, probability theory need 
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only be internally consistent. As a model of reality, however, the theory 
needs also to be consistent with the observed properties of the phenomena 
it purports to describe. To be a useful model, moreover, the theory should 
be capable of exposing facts about reality that were previously hidden 
from view. 

Probability theory is now a well-developed mathematical theory, and has 
proved also to be an excellent model of frequencies in repeated experi- 
ments. We will take advantage of this fact and apply probability theory to 
the analysis of the phenomenon of queueing. Characteristic of applied 
probability, and queueing theory in particular, is the interaction between 
the mathematical and intuitive (that is, interpretive) aspects. All mathe- 
matical results should be understandable in physical terms; and all intui- 
tive results should be capable of rigorous mathematical proof. Unfor- 
tunately, neither of these conditions can always be met. In this text 
intuitive arguments and interpretations will be emphasized, with the tacit 
understanding that, of course, one is never safe until a rigorous argument 
is proffered. 

We begin with a brief discussion of random variables in the context of 
queueing theory, and we follow with an example that illustrates the 
intuitive "frequency" approach to the calculation of probabilities. 

The mathematical definition of event involves the notions of sample 
space and Borel field, but for most practical purposes, intuitive notions of 
event are sufficient. Examples of the kinds of events of interest in queueing 
theory are {an arbitrary customer finds all servers busy}, {an arbitrary 
customer must wait more than three seconds for service), and {the number 
of waiting customers at an arbitrary instant is j). 

In each of these examples, the event can be expressed in numerical terms 
in the following ways. Define N to be the number of customers in an 
j-server system found by an arbitrary arriving customer. Then the event 
{an arbitrary customer finds all s servers busy} can be written {N>s}, 
and the probability of occurrence of this event can be represented by 
P{N>s}. 

Similarly, let W be the waiting time of an arbitrary customer, and let Q 
be the number of waiting customers (queue length) at an arbitrary instant. 
Then these events are { W>3] and {Q=J}, with probabilities P{W>3) 
and P{Q=j], respectively. 

The quantities N, W, and Q defined above are random variables. Each 
of them has the property that it represents an event in terms of a numerical 
value about which a probability statement can be made. More formally: A 
random variable is a function defined on a given sample space, and about 
whose values a probability statement can be made. 

The question of precisely which functions can qualify as random vari- 
ables is, like that of the notion of event, a mathematical one. For our 
purposes, it will be sufficient to consider the use of random-variable 
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notation as simply a device for describing events without explicit concern 
for the underlying probability space. 

A random variable is defined by assigning a numerical value to each 
event. For example, in a coin-tossing experiment, the event {head} can be 
assigned the value 1, and the event {tail} the value 0. Call the describing 
random variable X. Then at each toss a head will occur with probability 
P{X=l) and a tail with probability P{X=0}. This use of the random- 
variable notation may seem artificial, since the association of the event 
{head} with the value X = 1 is arbitrary, and the value X=2.6 might just as 
well have been chosen. But it will be seen that judicious choices of the 
random-variable definitions often lead to simplifications. For example, 
observe that any random variable X that takes only the values and 1 has 
expected value E(X) = 0P{X = 0) + \P{X= 1} = P{X= 1}; that is, the 
mean value of a zero-one random variable is equal to the probability that 
the random variable takes on the value 1. 

Suppose, for example, that we wish to study the random variable S„, 
which we define to be the number of heads occurring in n tosses. Let the 
random variable Xj describe the y'th toss; let Xj , = 1 if the yth toss results in a 
head, and Xj=0 for a tail. Then the random variable S„ can be interpreted 

as the sum S n s =X l -i + X n , The interpretation of S„ as a sum of random 

variables allows its study in terms of the simpler component random 
variables X v ...,X n , without further regard to the physical meaning of S n . 
We shall return to this example later. 

In most of the queueing-theory problems we shall encounter, however, 
the assignment of random-variable values to events is quite natural from 
the context of the problem. Only a malcontent would choose to assign the 
value Q = 42 to the event {six customers are waiting}. 

We now consider an example that will illustrate the intuitive 
"frequency" approach to the calculation of probabilities, while simulta- 
neously providing us with a result that will prove useful in queueing 
theory. Consider a population of urns, where an urn of type j (J~ 1,2,...) 
contains n, balls. Let the random variable X be the type of an urn selected 
at random; that is, P{X=j) is the probability that an arbitrary urn is of 
type J- We interpret P{X=j) as the frequency with which urns of type j 
occur in the general population of urns. Let us say that a ball is of type j if 
it is contained in an urn of type j, and let Y be the type of a ball selected at 
random; we interpret P{Y=J] as the frequency with which balls of type j 
occur in the general population of balls. Our objective is to derive a 
formula that relates the frequencies (probabilities) of occurrence of the 
urns to the corresponding frequencies (probabilities) of occurrence of the 
balls. 

Suppose first that m urns are selected "at random" from the (infinite) 
parent population, and let m y be the number of urns of type j among the m 
urns selected. Then the relative frequency of occurrence of urns of type j 
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among the m urns selected is m,/m. We now assume that this ratio has a 
limit as m tends to infinity, and we call this hypothesized limiting value the 
probability that an "arbitrary" urn is of type j: 

m, 

P{X=j}= lim -± 0=1,2,...). (1.1) 

Similarly, the number of balls that are of type j among all the balls in the 
m urns selected is w,/n y ; hence 

n,m t 

/>{y=;} = jim -^L- = 1,2,...). (1.2) 



2 n,m t 



If we divide the numerator and denominator of (1.2) by m, and use (1.1), 
we obtain 

n,P{X=j) 

P[Y =j} = -^ iL, 0=1,2,...). (1.3) 

2V{*=/} 



Equation (1.3) relates the (mathematical) probability distributions of X 
and Y to each other, without further regard for the (physical) frequencies 
these probabilities are supposed to represent. It is not our purpose here to 
discuss the mathematical theory of probability and its use as a model of 
frequencies of events in infinitely repeatable experiments, but merely to 
point out that our strategy as applied probabilists is both to use mathemat- 
ics to provide physical insight (that is, intuition) and to use intuition to 
guide and check the corresponding mathematical formalism. 

Equation (1.3) can be interpreted as saying that the frequency with 
which balls of type j occur is proportional to the product of the number of 
balls of type j per urn of type j and the frequency of occurrence of urns of 
typey; that is, P {Y=j} = crijP {X=j}, where c = (2 i n i P{X=i})- x is the 
constant of proportionality such that P{Y=l} + P{Y=2}-\ = 1. 

It is easy to verify that (1.3) yields the very obvious results that (1) if 
there are only a finite number r of urn types = l,2,...,r) and each type 
occurs with equal frequency, then 

and (2) if each type of urn contains the same number of balls, then 



P{Y=j} = P{X=j] 0=1,2,...). 
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As an example of the application of (1.3) in queueing theory, we 
consider a single-server system. We view time as being decomposed into a 
sequence of cycles, where each cycle consists of an idle period (during 
which there are no customers present and the server is idle) and an 
adjacent busy period (during which the server is continuously busy). In 
many important queueing models the cycles are independent and statisti- 
cally identical; that is, each cycle represents an independent trial in a 
sequence of infinitely repeatable experiments. Now suppose that the num- 
ber of customers who arrive during a busy period is N, with mean value 
E(N). [The calculation of E(N) depends on other assumptions of the 
model, such as whether customers who arrive during a busy period, and 
thus are blocked, wait for service or leave immediately or take some other 
action.] We shall show that the proportion II of customers who are 
blocked is given by 

According to (1.4) the probability of blocking equals the ratio of the 
mean number of customers who are blocked per cycle, E(N), to the mean 
number of customers who arrive during a cycle, 1 + E(N). This result has 
great intuitive appeal, and will also be useful when E(N) is easy to 
calculate. 

To prove (1.4), let X be the number of customers (balls) who arrive 
during an arbitrary cycle (urn), and let Y be the number of customers who 
arrive during the cycle that contains an arbitrary customer (called the test 
customer) whose viewpoint we shall adopt. Also, let a customer be of type j 
if exactly j customers (including himself) arrive during his cycle. Then 
Equation (1.3) applies, with rij—j: 

Now, if the test customer is one of j arrivals during a cycle, the probability 
that he is not the first of those j, and therefore is blocked, is It 
follows from the theorem of total probability* that the probability II that 
the test customer is blocked is 

n=f, J ^p{Y=j}. (i.6) 

7 = 1 J 



•If the events B X ,B 2 , ... are mutually exclusive (that is, /f,i?,=<f> for all i¥=j) and exhaustive 
(that is, B l uB 2 U-- - *=Q, where S2 is the sample space), then, for any event A,P{A} = 
P{A\B i }P{B 1 } + P{A\B 2 )P{B 2 ) + • • • . (See, for example, p. 15 of ^inlar [1975].) 
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Using (1.5) in (1.6), we get 



11= 



E(X) 



(1.7) 



But the number of arrivals during a cycle equals the number who arrive 
during the busy period plus the one who arrives at the end of the idle 
period and thus initiates the busy period; that is, X= N+ 1. Thus, E(X) = 
E(N)+\, and Equation (1.4) follows. 



1. In the model considered above, suppose that it costs c dollars to turn on the 
server at the start of each busy period. Show that 



that is, if the cost of a cycle is divided evenly among the customers who arrive 
during a cycle, then the average cost per arrival equals the cost per cycle 
divided by the average number of arrivals per cycle. 

2.2. Birth-and-Death Processes 

The theory of birth-and-death processes, developed largely by Feller, 
comprises part of the subject matter commonly called stoc tic processes. 
We now sketch an outline of the theory of birth-and-death ^ocesses. For a 
more comprehensive treatment see Chapter XVII of Feller [1968] and 
Chapter 1.4 of Cohen [1969]. 

Consider a system that for each fixed t (0 < t < oo) can be described by a 
random variable N(t) with realizations 0,1,2,.... Examples are (1) a 
telephone switchboard, where N(t) is the number of calls occurring in an 
interval of length t; (2) a queue, where N(t) is the number of customers 
waiting or in service at time t; (3) an epidemic, where the number of 
deaths that have occurred in (0, t) is N(t); and (4) a city whose population 
is N(t) at time /. 

We wish to study processes that can be described by such a random 
variable N(t). We shall make some assumptions about the behavior of the 
process that are simple enough to avoid intractable analysis but that 
nevertheless lead to useful models. [Note that N(t) is a (different) random 
variable for each fixed t>0. Our interest is the (uncountable) family of 
random variables {N(t), />0}, which is called a stochastic process. Ques- 
tions concerning the existence and properties of such uncountable 
sequences are addressed in books on the theory of stochastic processes. 
The interested reader is referred to the books referenced in the last section 
of this chapter.] 



Exercise 
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We say that a system is in state E- at time t if N(i) = j. Then a process 
obeying the following postulates is called a birth-and-death process: 

If at any time t the system is in state E Jt the conditional probability that 
during (t,t + h) the transition Ej-*E J+1 (y' = 0, 1,...) occurs equals Xjh + 
o(h) as h^O, and the conditional probability of the transition E J -*E J _ 1 
(j=l,2,...) equals fyh + o(h) as h-+0. The probability that during (/,/+ h) 
the index j changes by more than one unit is o(h) as /i— »0. [A quantity f(h) 
is said to equal o(h) as h^O if \im h _+ f(h)/h- t Q. Note that if f(h) — o{h) as 
/i^O, then also xf(h) = o(fi) as h— >0, for any finite x.] 

Applying the theorem of total probability, we can write 

P{N(t+h)-j}- f P{N(t + h)=j\N(t)=i}P{N(t) = i). (2.1) 

(=0 



Now it follows from the postulates that, as h— »0, 

P{N(t+h)=j\N(t) = i} = 
Since we require that 



Xj_ x h + e>(/i) when 
H J+l h + o(h) when / =j + 1 
when |i— j\>2. 



2 p{^(/+/ I )=^|^(/)=y} = i, 

A: = 

it follows that, as /i— »0, 

P{N(t + h) =j\N(t) =j} = \- (A, + tfh + o(h). 

Hence, if we set P{jV(f)=y} — Pj(t), then Equation (2.1) can be written 

P J (t + h)=X J _ 1 hPj_ l (t) + fi J+l hP J+l (t) 
+ [l-(\ J + l i f )h]P J (t) + o(h) 
[h->0; 7 = 0,1,...; A_ 1 = J u = j p_ 1 (f) = 0]. 

Rearranging and dividing through by h, we have 

PXt + hj-PAt) 

— = \j. + n J+ x p J+x {t) 
-h +l $P#) + 4& 

[ h ^0; y = 0,l,...; A_, = m o = P_ 1 (0 = 0]. (2.2) 
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We now let /i— >0 in Equation (2.2). The result is the following set of 
differential-difference equations for the birth-and-death process: 

f t Pj{t) = Xj_ x Pj_ x (t) + #v+i^f+i(0 " + P,)Pj(0 

[j = 0,l,...; X_, = ^=P_ l (t) = 0]. (2.3) 

If at time / = the system is in state E p the initial conditions that 
complement Equation (2.3) are 

w-[o '« 7Ji (2 - 4) 

The coefficients {A y } and { ju,} are called the birth and death rates, 
respectively. When /i y = for all j, the process is called a pure birth process; 
and when A, = for all j, the process is called a pure death process. 

In the case of either a pure birth process or a pure death process, it is 
easy to see that the differential-difference equations (2.3) can always be 
solved, at least in principle, by recurrence (successive substitution). 

For example, consider the important special case of the pure birth 
process with constant birth rate A, = A. (This model is often used to 
describe the arrival process of customers at a queue.) If we assume that the 
system is initially in state E Q , then for this case Equations (2.3) and (2.4) 
become 

j i P J (t) = XP J _ l (t)-XP J (t) [7 = 0,1,...; P.^O] (2.5) 

and 

jW \0 if j¥=0. v ' 

An easy solution by recurrence (or induction) gives, for each t>0, 

Pj(t)=Qjfe-*< 0=0,1,...). (2.7) 

Note that the probabilities given by (2.7) satisfy the normalization condi- 
tion 

(*>0). (2.8) 
(The normalization condition (2.8) is not satisfied for every choice of the 
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birth coefficients. For a more extensive treatment, see Chapter XVII of 
Feller [1968].) 

According to Equation (2.7), N(t) has the Poisson distribution with mean 
\t; we say that {N(t), t>0] is a Poisson process. Since the assumption 
Xj=\ is often a realistic one in the construction of queueing models, the 
simple formula (2.7) is important in queueing theory. The Poisson distribu- 
tion possesses important theoretical properties and plays a central role in 
queueing theory. 

Another important example is the special case of the pure death process 
with death rate proportional to the index of the state Ey, that is, fy=jp. 
(A real system that might fit this model is a population in which only 
deaths occur and where the death rate is proportional to the population 
size.) If we assume that the system is in state E n at / = 0, then Equations 
(2.3) and (2.4) become 

j- t PM=- ni iP n (t) [P H (0)-1]> (2-9) 

[P y (0)=0; y = n-l,«-2,...,2,l,0]. (2.10) 

Solving these equations by recurrence, it is easy to verify that the general 
form of the solution Pj(t) is 

PAt) = (jYe-^y(l-e-n n - J (y=0,l,..., W ). (2.11) 

We remark that the probabilities defined by (2.11) constitute a binomial 
distribution, and therefore sum to unity. 

Exercise 

2. Consider a population modeled as a pure birth process in which the birth rates 
are proportional to the population size, that is, A, — y'X (/ — 0> !»•••)• Write the 
differential-difference equations that determine the probability Pj(t) that N(t) 
=j for j= 1,2,... and all t > 0; and show that if N(0)= 1, then 

PjO^e-^O-e-^y- 1 = 1,2,...). 

We have observed that in the case of the pure birth process ( ju, = 0), the 
differential-difference equations (2.3) can always be solved recurrently, at 
least in principle. Therefore, even though the number of equations is, in 
general, infinite, there is no question about the existence of a solution, 
since the solution can be exhibited. On the other hand, it is not necessarily 
true that the solution {Pj(t)} is a proper probability distribution. In any 
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particular case, a good strategy may be to find the solution first, and then 
determine if the solution is a proper probability distribution, rather than 
vice versa. 

In the case of the pure death process (A, = 0), the differential-difference 
equations (2.3) offer less theoretical difficulty, since not only can they be 
solved recurrently, but also they are finite in number. 

In contrast, in the general case, the equations (2.3) of the birth-and- 
death process do not yield to solution by recurrence, as one can easily 
verify. Also, in general, these equations are infinite in number. Therefore, 
both practical and theoretical difficulties present themselves. 

The questions of existence and uniqueness of solutions are difficult and 
will not be discussed here. Suffice it to say that in almost every case of 
practical interest, Equations (2.3) and (2.4) have a unique solution that 
satisfies 



The question is discussed in some detail in Chapter XVII of Feller [1968], 
who also gives several pertinent references. 

As an example of the use of the birth-and-death process in a queueing- 
theory context, we consider a model of a queueing system with one server 
and no waiting positions. Specifically, we assume that the probability of a 
request in (t,t + h) is Xh + o(h) as h~*0, and assume that if the server is 
busy with a customer at t, the probability that the customer's service will 
end in (t,t + h) is nh + o(h) as h^>0. Assume further that every customer 
who finds the server occupied leaves the system immediately and thus has 
no effect upon it. 

In terms of the postulates for the birth-and-death process, this queueing 
model corresponds to a two-state birth-and-death process. E corresponds 
to the state {server idle}, and E x corresponds to the state (server busy). 
Since, by assumption, an arrival that occurs when the server is busy has no 
effect on the system, an arrival will cause a state transition if and only if it 
occurs when the server is idle. Therefore, the effective arrival rates are 
Xo=A and A, = for j¥=0. Similarly, no customers can complete service 
when no customers are in the system, so that ju,=0 wheny'^l, and, by 
assumption, jn, = (i. The birth-and-death equations (2.3) for these particular 
choices of the birth-and-death coefficients are 



[0<Pj(t)<U 0</<oo]. 



(2.12) 



j-0 




(2.13) 



and 




liP x {t). 



(2.14) 
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Standard techniques exist for the solution of sets of simultaneous linear 
differential equations, but we shall solve this simple set by using its special 
properties. 

First note that when Equations (2.13) and (2.14) are added, we obtain 

^[Po(0 + ^(0]-o, 

so that the sum of the probabilities is constant for all t > 0, 

J> (/) + />,(/) -c. (2.15) 

We require that the system initially be describable by a probability 
distribution, so that 

i' o (0) + P 1 (0)=l. (2.16) 
Then Equations (2.15) and (2.16) require c= 1, and hence 

P (t) + P l (t)=\ (/>0). (2.17) 
Substitution of (2.17) into (2.13) yields 

^i> o (0 + (X + M)^o(') = /*, 
which has the general solution 

By symmetry, Equations (2.17) and (2.14) yield 

Equations (2.18) and (2.19) comprise the transient solution, which de- 
scribes the system as a function of time. In most applications, interest 
centers not on the values of these probabilities at a specific point in time, 
but rather on their long-run values. This topic is discussed in the next 
section. 



23. Statistical Equilibrium 

Suppose that we are interested in the behavior of the system just described 
for large values of /, that is, after it has been in operation for a long period 
of time. The state probabilities as functions of time are given by Equations 
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(2.18) and (2.19). Letting *->oo in (2.18) and (2.19), we obtain 



P = lim P (t) = 



A + (i 



(3.1) 



and 



P,= lim P x {t) = 



A 



(3.2) 



A+jii " 



Observe that 



^0+^1=1, 



(3.3) 



so that the limiting distribution is proper. Note that the limiting values of 
the probabilities are independent of the initial values P^O) and P x (0). In 
other words, after a sufficiently long period of time the state probabilities 
are independent of the initial conditions and sum to unity; the system is 
then said to be in statistical equilibrium or, more simply, equilibrium. 

An important characteristic of the statistical-equilibrium distribution is 
that it is stationary; that is, the state probabilities do not vary with time. 
For example, if our system were assumed to be in equilibrium at some time 
t, say t = 0, so that P (0) = n) and P l (0) = \/(X + fi% then Equations 

(2.18) and (2.19) make it clear that these initial values will persist for all 
t > 0. In other words, when a system is in statistical equilibrium no net 
trends result from the statistical fluctuations. 

Another important property of a system possessing a statistical 
equilibrium distribution is that it is ergodic, which means that in each 
realization of the process, the proportion of the time interval (0,jc) that the 
system spends in state Ej converges to the equilibrium probability Pj as 



Note carefully that the concept of statistical equilibrium relates not only 
to the properties of the system itself, but also to the observer's knowledge 
of the system. In our example, if an observer were to look at the system at 
any time t, then he would find the system in either state E x or E , say E . If 
he were to look again at any later time x = t + 1', the probability that he 
would then find the system in state E x , say, is given by (2.19) with t = t' 
and .P^O) = 0; whereas the corresponding probability would be given by 
(3.2) if equilibrium had prevailed at time / and the observer had not 
looked. Thus, the value of the probability Pj(t+ 1') depends on whether or 
not an observation was made at t, even though the system itself is not 
physically affected in either case. 

Clearly, practical applications of queueing theory will be largely con- 
cerned with the statistical-equilibrium properties of a system. Therefore, it 
would be useful to be able to obtain the equilibrium distribution directly 



x— ><x>. 



Statistical Equilibrium 



21 



(when it exists), without having to find the time-dependent probabilities 
first. In our example, since the limiting probabilities lim^^ Pj(t) = Pj have 
been shown directly to exist, it follows from Equations (2.13) and (2.14) 
that \im t _ ttOO (d/dt)P J (t) = 0. This suggests that the equilibrium probabilities 
might follow directly from Equations (2.13) and (2.14) when the time 
derivatives are set equal to zero. That is, letting t-±co in (2.13) and (2.14), 
we obtain 

= V.P x -\P Q (3.4) 

and 

0=XP -[xP ] . (3.5) 
Equations (3.4) and (3.5) are identical; they yield 

^i=^o- (3.6) 

Since the initial conditions (2.16), which led to the normalization equation 
(2.17), no longer appear, we must specify that 

= (3-7) 

We then obtain from Equations (3.6) and (3.7) that P = n/(\ + n) and 
P l =\/(\ + p), in agreement with the previous results (3.1) and (3.2). Thus 
we have obtained the statistical-equilibrium solution by solving the linear 
difference equations (3.4) and (3.5) instead of the more difficult linear 
differential-difference equations (2.13) and (2.14). 

We have shown that, in this simple example at least, the statistical- 
equilibrium distribution can be obtained in two different ways: 

1. Solve the differential-difference equations (2.3), with appropriate ini- 
tial conditions, to obtain Pj(t), and then calculate the limits 
\im l _^ oa P/ < t) = P r 

2. Take limits as t->cc throughout the basic differential-difference equa- 
tions (2.3), set limpid/ dt)Pj(t)=0 and lim,^ Pft) = P p solve the 
resulting set of difference equations, and normalize so that 2jl ^y 
= 1. 

Method 2 is clearly the easier way to obtain the equilibrium distribution, 
since the problem is reduced to solving a set of difference equations 
instead of a set of differential-difference equations. 

Let us now move from this simple motivating example to the general 
birth-and-death process. One might hope that method 2 applies to the 
general birth-and-death equations (2.3). In essentially all birth-and-death 
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models of practical interest, it does. The following informal theorem, 
which we state without proof, is useful in this regard. 

Theorem. Consider a birth-and-death process with states E^E^.., and 
birth and death coefficients A y >0 O'=0, 1,...) and n>0 = 1,2,...); and 
let 

g ,l + jo + VL + ... + ^-V» + ,„ i (3g) 
Mi Hi to M,/vMj 

Then the limiting probabilities 

Pj= Mm Pj{t) 0=0,1,...) (3.9) 

t— »oo 

exist, and (for the models considered in this book) do not depend on the 
initial state of the process. If S <°°, then 



s- 1 0=0), 



Since the probabilities (3.10) sum to unity, they constitute the statistical- 
equilibrium distribution. If S= oo, then Pj = Q for all finite j, so that, roughly 
speaking, the state of the process grows without bound and no statistical- 
equilibrium distribution exists. If the process has only a finite number of 
allowed states E ,E u ...,E n , then put \, = in (3.8); in this case we shall 
always have S< oo, and hence (3.10) is the statistical-equilibrium distribu- 
tion, where, of course, Pj=0 for all j>n. 

For example, consider again the (finite-state) system described by (3.1) 
and (3.2), for which the birth coefficients and death coefficients are \q=\, 
A, =0, and Mi = M- In this case, S= l+A//i< oo, and therefore the statisti- 
cal-equilibrium distribution exists and is given by (3.10), in agreement with 
(3.1) and (3.2). 

As a second example, consider the birth-and-death process in which 
A,_ l = /i r >0 for y=l,2,... . Then S= 1 + I + • • * = oo; hence Pj = = 
0,1,...) and no equilibrium distribution exists. 

A third example is provided by the three-state birth-and-death process 
characterized by Aq = 0, A, = /*, > 0, and \ 2 = th = 0- Oar theorem is inappli- 
cable, because the condition ^ >0 (J= 1,2,...) is violated [causing S, given 
by (3.8), to be undefined]. However, it is clear from physical considerations 
that a limiting distribution does exist: If the initial state is E t and /#1, 
then Pi=\\ and if the initial state is E v then P =P 2 = i (ind, again, 
Pi—0). In other words, the system is not ergodic, because if it ever 
occupies state E it will remain there forever, and similarly for E 2 . Thus, 
the limiting distribution depends on the initial conditions and is therefore 
not a statistical-equilibrium distribution. 
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It is instructive to assume the existence of lim,^ Pj{t) — P p and then 
derive (3.10). Observe first that, by (2.3), the existence of Xim^Pp)^ P } 
for all j implies the existence of limpid / di)P ft) for ally. Consequently, 
lim,^ (d / dt)Pj(t) = Q, because any other limiting value would contradict 
the assumed existence of lim^^ P/t). Thus, taking limits throughout the 
equations (2.3) and rearranging, we have: 

(Xj + ^Pj-Xj^Pj. , + t P J+l 
= 0,1,...; A_ 1 = f * = 0). (3.11) 



When 7 = 0, Equation (3.11) is 

and when j — 1, Equation (3.1 1) is 

(Ai + Mi)^i=V > o + /VV (3-13) 
Equations (3.12) and (3.13) together yield 



Similarly, (3.14) and (3.11) for y=2 together yield X 2 P 2 = ^P^ continuing 
in this way we derive the following set of equations, which are equivalent 
to (3.11) but much simpler in form: 

Vy-^+i^+i = 0,1,...). (3.15) 

If , > fory =0, 1, . . . , then (3.15) yields 

Pj.^^Pj = 0,1,...), (3.16) 



or, equivalently, 



V.-A,- (y=I , 2 ,...). (3 .,7) 



In the cases of practical interest for our purposes, it will always be true 
that ^ +1 >0 = 0, 1,...) and either (i) Xj>0 for ally =0,1,... or (ii) X y >0 
for y = 0, 1,...,«- 1 and \,=0. In both cases, the probability P is de- 
termined formally from the normalization condition 



y-o 



P r \. 



(3.18) 
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In case (i) the sum on the left-hand side of (3.18) may fail to converge 
when (3.17) is substituted into it unless P =0. This would imply that P y =0 
for all finite j, and therefore no statistical-equilibrium distribution would 
exist. In case (ii) this sum will always converge, because it is composed of 
only a finite number of terms [because, from (3.17), i^ = for all j>ri\. 
Therefore, if the initial state is E ( and i<n, then the statistical-equilibrium 
distribution exists and is given by (3.17), with P Q calculated from (3.18). 
Thus we see that in both cases we obtain results that agree with (3.10). 

If AjX) for all j, and ty = for some j = k, then, as Equation (3.15) 
shows, Pj=*Q for 7 = 0, l,...,k— 1. An extreme example is provided by the 
pure birth process, where ty-0 for all values of the index j. Thus Pj=0 
O' = 0, 1,...,) for the pure birth process; that is, no proper equilibrium 
distribution exists. In the particular case of the Poisson process, for 
example, where A, = A (,/'=0, 1, . . .), the time-dependent probabilities {P/t)} 
are given by Equation (2.7), from which we see that these probabilities are 
all positive and sum to unity for all finite / > 0, but that each probability 
approaches zero as t— >oo. 



Exercise 



3. Consider a birth-and-death process with p. k = and ^ > when j >k, and A, > 
for all j. Show that when S<oo, 



where 



and 





\c\c+l' " ' A,- 1 



0=o,i k-\), 

(/-*). 

P k (j = k+l,k + 2,...), 



00 J ' \ 

s=i+ s n 

j = k+ \ i-k+\ Mf 

^ = = 0,1,...) when S = oo. 



Equation (3.11) admits of a simple and important intuitive interpreta- 
tion: To write the statistical- equilibrium state equations (sometimes also 
called the balance equations), simply equate the rate at which the system 
leaves state Ej to the rate at which the system enters state Ej. Similarly, 
Equation (3.15) can be interpreted as stating that the rate at which the 
system leaves state Ej for a higher state equals the rate at which the system 
leaves state E J+l for a lower state. 

Equations (3.11) and (3.15) are statements of conservation of flow. 
Recall that in Chapter 1 we gave a heuristic derivation of an important 
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queueing formula using this concept. Specifically, we appealed to this 
concept to derive the statistical-equilibrium equation (1.1) of Chapter 1: 

\Pj = (j+l)T- l P j+l = 0,1,. ..,5-1). 

Observe that this equation is the special case of the statistical-equilibrium 
state equation (3.15) with A, = X and fy +l : =(J+ 1)t -1 for j=0, 1 
and A, = for j >s, and that the heuristic conservation-of-flow argument of 
Chapter 1 is identical with the intuitive interpretation of Equation (3.15). 

Thus far our discussion has been limited to birth-and-death processes 
for which the ordering of states E ,E X ,E 2 ,... arises quite naturally. As we 
shall see, however, this is not always the case. Quite often problems arise in 
which the natural definition of states requires two variables, for example, 
Ey (/ = 0, 1,2,...; y = 0, 1,2,...). We have shown that for the one-dimen- 
sional case, the "rate out equals rate in" formulation (3.1 1) is equivalent to 
the algebraically simpler "rate up equals rate down" formulation (3.15). Of 
course, these two-dimensional states can always be relabeled so that the 
problem reduces to a one-dimensional case, but in general the result is no 
longer a birth-and-death process. Thus, the "rate up equals rate down" 
formulation, as exemplified by (3.15), is in general inapplicable in the 
multidimensional case. However, as we will show in Chapter 4, multidi- 
mensional birth-and-death models can be described by "rate out equals 
rate in" equations that are analogous to (3.11). 

To summarize, from a practical point of view, it is assumed that a 
system is in statistical equilibrium after it has been in operation long 
enough for the effects of the initial conditions to have worn off. When a 
system is in statistical equilibrium its state probabilities are constant in 
time, that is, the state distribution is stationary. This does not mean that 
the system does not fluctuate from state to state, but rather that no net 
trends result from the statistical fluctuations. The statistical-equilibrium 
state equations are obtained from the birth-and-death equations by setting 
the time derivatives of the state probabilities equal to zero, which reflects 
the idea that during statistical equilibrium the state distribution is constant 
in time. Equivalently, the statistical-equilibrium state probabilities are 
defined by the equation "rate out equals rate in," which reduces in the 
one-dimensional case to "rate up equals rate down." A system with a 
statistical-equilibrium distribution is ergodic, and hence the probability Pj 
can be interpreted as the proportion of time that the system will spend in 
state Ej (y' = 0, 1,2,...), taken over any long period of time throughout 
which statistical equilibrium prevails. 

It should be apparent that a deep understanding of the theory of 
birth-and-death processes requires extensive mathematical preparation. We 
emphasize that the present treatment is informal; the interested reader 
should consult more advanced texts on stochastic processes and Markov 
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processes, such as ginlar [1975], Cohen [1969], Feller [1968], Karlin [1968], 
and Khintchine [1969]. (Roughly speaking, a Markov process is a process 
whose future probabilistic evolution after any time / depends only on the 
state of the system at time f, and is independent of the history of the 
system prior to time t. It should be easy for the reader to satisfy himself 
that a birth-and-death process is a Markov process.) Hopefully, it is also 
apparent that the birth-and-death process has sufficient intuitive appeal so 
that useful insights can be gained from the preceding cursory summary. 
We shall return to the birth-and-death process in Chapter 3, where we shall 
develop queueing models by judiciously choosing the birth-and-death 
coefficients. 

2.4. Probability-Generating Functions 

Many of the random variables of interest in queueing theory assume only 
the integral values j = 0,1,2,.... Let K be a nonnegative integer-valued 
random variable with probability distribution {/?,}, where Pj = P{K=j] 
(y = 0, 1,2,...). Consider now the power series g(z), 

g(z)=p +p l z+p 2 z 2 +--> , (4.1) 

where the probability pj is the coefficient of z J in the expansion (4.1). Since 
{Pj} is a probability distribution, therefore g(l)= 1. In fact, g(z) is conver- 
gent for |z|< 1 [so that the function (4.1) is holomorphic at least on the 
unit disk — see any standard text in complex-variable or analytic-function 
theory]. Clearly, the right-hand side of (4.1) characterizes K, since it 
displays the whole distribution {pj}. And since the power-series repre- 
sentation of a function is unique, the distribution {pj} is completely and 
uniquely specified by the function g(z). The function g(z) is called the 
probability-generating Junction for the random variable K. The variable z 
has no inherent significance, although, as we shall see, it is sometimes 
useful to give it a physical interpretation. 

The notion of a generating function applies not only to probability 
distributions {/>,}, but to any sequence of real numbers. However, the 
generating function is a particularly powerful tool in the analysis of 
probability problems, and we shall restrict ourselves to probability-generat- 
ing functions. The generating function transforms a discrete sequence of 
numbers (the probabilities) into a function of a dummy variable, much the 
same way the Laplace transform changes a function of a particular 
variable into another function of a different variable. As with all transform 
methods, the use of generating functions not only preserves the informa- 
tion while changing its form, but also presents the information in a form 
that often simplifies manipulations and provides insight. In this section we 
summarize some important facts about probability -generating functions. 
For a more complete treatment, see Feller [1968] and Neuts [1973]. 
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As examples of probability-generating functions, we consider those for 
the Bernoulli and Poisson distributions. A random variable X has the 
Bernoulli distribution if it has two possible realizations, say and 1, 
occurring with probabilities P{X=0} = g stndP{X= \}=p, where/? + # = 
1. As discussed in Section 2.1, this scheme can be used to describe a coin 
toss, with X=\ when a head appears and X = when a tail appears. 
Referring to (4.1), we see that the Bernoulli variable X has probability- 
generating function 

g{z) = q+pz. (4.2) 

(If this seems somewhat less than profound, be content with the promise 
that this simple notion will prove extremely useful.) 

In Section 2.2 we considered the random variable N(t), defined as the 
number of customers arriving in an interval of length and we showed 
that with appropriate assumptions this random variable is described by the 
Poisson distribution 

P{N(t)=j}=^jfe-" 0=0,1,...). 
If we denote the probability-generating function of N(t) by g(z), then 

7-0 J' y-0 J' 

and this reduces to 

g ( z ) = e -Mi-*\ (43) 

Having defined the notion of probability-generating function and given 
two important examples, we now discuss the special properties of such 
functions that make the concept useful. Since the probability-generating 
function g(z) of a random variable K contains the distribution {pj} 
implicitly, it therefore contains the information specifying the moments of 
the distribution {pj}. Consider the mean E(K), 

E(K)= f,j Pj . (4.4) 
J-i 

It is easy to see that (4.4) can be obtained formally by evaluating the 
derivative (d/dz)g(z) = XJLJp J z J ~ 1 at z= 1: 



E{K) = g'{\). 



(4.5) 
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From (4.2) and (4.5) we see that the Bernoulli random variable X has 
mean 

E{X)=p, (4.6) 

and from (4.3) and (4.5) we see that the Poisson random variable N(i) has 
mean 

E{N(t))=\t. (4.7) 

Similarly, it is easy (and we leave it as an exercise) to show that the 
variance V(K) can be obtained from the probability-generating function as 

V(K) = g"{\) + g'{\)-[g'{\)] 2 . (4.8) 
This formula gives for the Bernoulli variable 

V{X)=pq (4.9) 

and for the Poisson variable 

V{N(t))=Xt. (4.10) 

Sometimes, as we shall see, it is easier to obtain the probability-generat- 
ing function than it is to obtain the whole distribution directly. In such 
cases, the formulas (4.5) and (4.8) often provide the easiest way of 
obtaining the mean and variance. The higher moments can also be calcu- 
lated in a similar manner from the probability-generating function, but the 
complexity of the formulas increases rapidly. If primary interest is in the 
moments of a distribution rather than the individual probabilities, it is 
often convenient to work directly with the moment-generating function or 
the related characteristic function, which generate moments as (4.1) gener- 
ates probabilities. These topics are covered in most texts on mathematical 
statistics, such as Fisz [1963]. 

Another important use of probability-generating functions is in the 
analysis of problems concerning sums of independent random variables. 
Suppose K= K x + K 2 , where K x and K 2 are independent, nonnegative, 
integer- valued random variables. Then P { K= k] (k = 0, 1, 2, . . . ) is given by 
the convolution 

P{K=k}= 2 P{K x =j}P{K 2 = k~j). (4.11) 
J-o 

Let K x and K 2 have generating functions g x (z) and g 2 (z), respectively: 



j-o 
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and 



2 P{K 2 =j}zK 

Then term-by-term multiplication shows that the product g x {z)g 2 (z) is 
given by 



8\(z)g 2 ( z ) = 2 



k 



2 P{K x ~j}P{K 2 -k-j) 



(4.12) 



If has generating function g(z) = ^_ P{K= k}z k , then (4.11) and 
(4.12) show that 

g{z) = g x {z)g 2 (z). (4.13) 

Thus we have the important result: The generating function of a sum of 
mutually independent random variables is equal to the product of their 
respective generating functions. 

For a more concise proof of (4.13), observe that for any random variable 
K with probability-generating function g(z), the random variable z K has 
expected value E(z K ) given by 

E{z K ) = g{z). (4.14) 

Then if K=K l + K 2 , it follows that E(z K ) = E(z K ' +K *) = E(z Kl z K >). If K x 
and K 2 are independent, then the random variables z Kl and z K * are also 
independent, and thus E(z Kl z K >) = E(z K *)E(z K *), from which (4.13) 
follows. 

Consider again the coin-tossing experiment described in Section 2.1. A 
coin is tossed n times, with Xj — \ if a head appears and Xj=0 if a tail 
appears on the jth toss. We wish to determine the probability that k heads 
appear in n tosses. Let 

S m -X l +---+X Ht (4.15) 

so that the value of the random variable S n is the number of heads 
appearing in n tosses. is the sum of n mutually independent, identically 
distributed Bernoulli variables {Xj}, each with generating function g(z) 
given by (4.2). Therefore, S n has generating function [g(z)] n ; hence 



(q+P*) m - 2 P{S n = k}z k . 

k = 



(4.16) 
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Expanding the left-hand side of (4.16) by the binomial theorem, we obtain 
(q+pzT= t ( n ){pz) k q»- k . (4.17) 
Equating coefficients of z k in (4.16) and (4.17) yields 

[0 (k>n), 

which is, of course, the binomial distribution. 

The result (4.18) could have been obtained by direct probabilistic 
reasoning. (Any particular sequence of k heads and n — k tails has proba- 
bility p k q n ~ k , and there are (£) such sequences.) In this example, the 
direct probabilistic reasoning concerning the possible outcomes of n tosses 
is replaced by the simpler probabilistic reasoning concerning the possible 
outcomes of one toss and the observation (4.15) that leads to the use of 
probability-generating functions. In a sense, probabilistic or intuitive rea- 
soning has been traded for more formal mathematical manipulation. In the 
present case both approaches are simple, but this is not always true. 
Roughly speaking, the difficulties encountered in the generating-function 
approach, when applicable, are not extremely sensitive to the underlying 
probabilistic structure. Thus the use of generating functions tends to 
simplify hard problems and complicate easy ones. 

Another example is provided by the Poisson distribution. Let N=N X + 
N 2 , where N ( is a Poisson random variable with mean \ ; that is, P{N;=j} 
= (\{/jl)e~ Xi . Then N has probability-generating function 

g( z ) = e - z) e -\ 2 (i-z) s=e -(x 1 +\ 2 Xi - z)^ (4 

Equation (4.19) shows (with no effort) that the sum of two independent 
Poisson variables with means X, and \ 2 is itself a Poisson variable with 
mean X= A, +A 2 . 

Exercise 

4. Compound distributions. Let X x , X 2 , . . . be a sequence of independent, identically 
distributed, nonnegative, integer-valued random variables with probability-gen- 
erating function f(z); and let N be a nonnegative, integer-valued random 
variable, independent of X U X 2 ,..., with probability-generating function g(z). 
Let Sff denote the sum of a random number of random variables: S N * s X l 

H +X N . (In general, the distribution of the sum of a random number of 

independent random variables is called a compound distribution.) 

a. Show that S N has probability-generating function g(f(z)). 

b. Show that E(S N ) = E(N)E(X) and V(S„) = E(N) V(X) + V(N)E 2 (X). 
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It was mentioned previously that the variable z in the generating func- 
tion has no inherent significance, but that it is sometimes useful to give it a 
probabilistic interpretation. This may allow us to obtain the generating 
function directly from probabilistic considerations and, in so doing, to 
replace tedious arguments by elegant ones. This probabilistic interpreta- 
tion, suggested by D. van Dantzig and called by him the method of 
collective marks, is: Imagine that the random variable whose probability- 
generating function we seek has been observed to have realization K=j, 
say. Now suppose we perform j Bernoulli trials, where at each trial we 
generate a "mark" with probability 1 — z (and no mark with probability z). 
Then z J is the probability that none of the j trials resulted in a mark. 
Hence, 'ZJ^qP {K=j} z J => g(z) is the probability that no marks are gener- 
ated by the realization of K. 

For example, consider Exercise 4a, where we wish to determine the 
probability-generating function of the sum S N of a random number of 
random variables. The probability that the realization of X x> say, generates 
no marks is equal to its probability-generating function f(z). Thus, the 

probability that the realization of X x -\ + X n generates no marks is 

U( Z )T' Hence, the probability that the realization of S N generates no 
marks is ^ l P{N=n}[f(z)T = g(f(z)). 

We shall appeal to the method of collective marks to eliminate messy 
calculation at various points in the text. For further discussion see Neuts 
[1973] and Runnenburg [1965] (as well as the accompanying discussion of 
Runnenburg's paper by van der Vaart). 

Exercise 

5. Let Nx and N 2 be independent, identically distributed, nonnegative integer-val- 
ued random variables. We consider two different procedures for distributing 
iV, + N 2 balls into two cells. In procedure (a), we place N { balls in the first cell 
and N 2 balls in the second cell. In procedure (b), we consider the N t + N 2 =N 
balls together, and independently place each of the N balls in a cell, with each 
ball having probability \ of being placed in the first cell and probability \ of 
being placed in the second cell. Define the generating functions 

j-0 

and show that if procedures (a) and (b) are equivalent, then 



?(%W=f 2 (x)' 
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Perhaps the most important use of probability-generating functions in 
queueing theory is the solution of probability state equations. Consider, for 
example, the birth-and-death equations (2.3) with A y =A, iij=jn (J — 
0, 1,2,...), and initial condition P (0) = 1. The equations are 

f t Pj{t) = XPj_.it) + (j + l)pP J+ ,(*) - (A +jp)Pj(t) 

[i'_ 1 (0 = 0; 7 = 0,1,...]. (4.20) 

These equations describe the following important queueing model. 
Customers request service from an infinite-server group. The probability 
that exactly one customer arrives in (t, t + h) is Ah + o{h) as h-*0. If there 
are j customers in service at time t, the probability that exactly one 
customer will complete service in (t,t + h) is jtih + o(h) as h— >0, and the 
probability that more than one change (arrivals and/or service comple- 
tions) occurs in (i, t + h) is o(h) as h—>0. (As we shall see, these assumptions 
are true if the customers arrive according to a Poisson process with rate A, 
and the service times are mutually independent, identically distributed 
exponential random variables, each with mean /* _t .) Pj(t) is the probability 
that j customers are in service at time t. 

Although this model is an important one in queueing theory, we shall 
defer discussion of it to later chapters. Our present concern' is simply the 
application of the theory of probability-generating functions to the solu- 
tion of the probability state equations (4.20). 

We emphasize that (4.20) is an infinite set of simultaneous linear 
differential-difference equations, an apparently hopeless case. Undaunted, 
we define the probability-generating function P(z,t), 

P(z,t)= f Pj(t)zJ. (4.21) 

7=0 

Now multiply each equation in (d/dt)P.(t) of the set (4.20) by z j and add 
all the equations. The result is 

oo i oo oo 

2 Tt Pj{t)z j =A__Pj- x {t)zJ+^u+m+M zJ 

j=o ai j=\ y-o 

-A f PjOW-H I JPp)z J , (4.22) 

7=0 7=1 

which in view of (4.21) can be written 

3 8 9 

— P(z, t) = \zP(z, t) + [i— P(z, t) - \P(z, t)-nz~ P(z, t). (4.23) 
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When rearranged into standard form, Equation (4.23) reads 

-| P(z, t)-(l-z)fx~~ P(z, t) = - A(l - z)P(z, f). (4.24) 

Thus we have transformed the infinite set of simultaneous linear dif- 
ferential-difference equations (4.20) for Pfi) into the single linear first- 
order partial differential equation (4.24) for the generating function P(z, t). 
It remains to solve (4.24) for P{z, i) and invert. 

Equation (4.24) can be solved in general by well-known techniques (see 
any standard text, such as Garabedian [1964]). The solution that satisfies 
i>(z,0)= 1, corresponding to the initial condition P o (0) = 1, is 



P(z, t) — exp 



-(l-e-'OO-z) 



(4.25) 



as is readily verified by substitution into (4.24). The state probabilities Pj(i) 
are easily found by recognizing (4.25) as the probability-generating func- 
tion for the Poisson distribution with mean (A/ju)(l — e"*"): 



Pj{t) = ± ~z J - exp 



0=o,i,...). 

(4.26) 



Several observations are in order. As in Section 2.2, we have solved the 
birth-and-death equations (2.3) completely for a particular choice of birth 
and death rates and a partieular initial condition. The normalization 
requirement (2.12) is satisfied, since for any /, Pj(t) is a Poisson probability. 
Equivalently, P(l,t)=l, as (4.25) shows. The statistical-equilibrium distri- 
bution can be obtained directly by taking limits in (4.26): 

P = \imPAt)=tH$-e~V» = 0,1,...). (4.27) 

The limiting distribution is the Poisson distribution with mean A/ p. 

The limiting distribution (4.27) can also be obtained by taking limits in 
the probability-generating function (4.25). In general, it is true that a 
sequence of probability distributions converges to a limiting distribution if 
and only if the corresponding generating functions converge (see Chapter 
XI.6 of Feller [1968]). 

Finally, note that (4.27) can be obtained directly from the statistical- 
equilibrium state equations and the normalization condition, that is, from 
(3.17) and (3.18) with A,.=A, Hj=jn = 0,1,2,...). 
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Exercise 

6. For the model of Exercise 2, Section 2.2, define the generating function 
JW)-2j°_oW*'. 

a. Show directly from the birth-and-death equations that P(z,t) satisfies the 
partial differential equation 

b. Verify that this equation has solution 



satisfying the initial condition N(0) = 1 [and, of course, the normalization 
condition 2^1o^(0 — 1]> an d verify that P(z,f) generates the distribution 
found in Exercise 2. 
c Verify that the solution P„(z,t) of the partial differential equation corre- 
sponding to the initial condition N(0) = n > 1 is given by P„(z,t)~ P n (z,t). 
Give a physical interpretation of this result. [It can be shown by expansion 
of P"(z,t) that for N(0) = n > 1, 

Pj(0 = ( J J Z 1 n ) e ~ nX '( l ~ e ~ X 'y~ n (j = n,n+\,...) 
and Pj(i) = § when j <n.] 

2.5. Some Important Probability Distributions 

We consider several distributions that are important in queueing theory. 



Bernoulli Distribution 

A random variable X has the Bernoulli distribution if it has two possible 
outcomes, say X= 1 and X=0 (often called success and failure, respec- 
tively) occurring with probabilities P{X= \ }—p and P{X = 0} = q, where 
p + q— 1. As mentioned previously, this scheme is often used to describe a 
coin toss, with X = 1 when a head occurs. The Bernoulli variable has 
probability-generating function q + pz, mean p> and variance pq. 



Binomial Distribution 

The binomial distribution describes the number of successes in n Bernoulli 
trials, which, by definition, are a sequence of n independent experiments, 
each of which results in either "success," with probability p, or "failure," 
with probability \ —p = q. If S n is the number of successes in n Bernoulli 
trials, then S n can be represented as the sum S n = X l + • • • +X n , where Xj 
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is the Bernoulli variable with value A} = 1 if the 7th trial results in success 
and Xj = for failure. Since the {Xj} are mutually independent, identically 
distributed random variables, each with probability-generating function 
q+pz, then S„ has generating function (q+pz)". Expansion of the generat- 
ing function shows that S n has the binomial distribution 

p{s n =k)={ n k ) P k (\- P y- k (*-o,i,...,«). (5.1) 

As mentioned previously, Equation (5.1) can be derived directly by 
probabilistic reasoning: Since the trials are mutually independent, each 
trial having probability p of success, any sequence of k successes and n — k 
failures has probability p k {\ -p)"~ k . There are 

( n \ n -± 

\k) k\(n-k)\ 

ways to choose the k locations of the successes in a sequence of k successes 
and n-k failures. Thus there are altogether (jj) different sequences of k 
successes and n — k failures, and (5.1) follows. 

The binomial distribution is often used to describe the number of busy 
telephone lines in a line group. This use of (5.1) is based on the implicit 
assumptions that each line has equal probability p of being busy, and that 
the lines become busy and idle independently of each other. These implicit 
assumptions are often not true, in which case (5.1) provides only a rough 
approximation. 

The random variable S n with distribution (5.1) has mean np and vari- 
ance npq, which follows directly from the interpretation of S n as a sum of n 
identical, mutually independent Bernoulli variables. 

Exercise 

7. Suppose S„ has the binomial distribution (5.1) with * = and S„ 2 has the 
binomial distribution (5.1) with n = n 2 . Show that if 5 , „ | and S„ 2 are indepen- 
dent, then 5„ ( + S„ 2 has the binomial distribution (5.1) with n = + n 2 . 

Multinomial Distribution 

Suppose that we have a sequence of n independent trials, but at each trial 
the number of possible outcomes is r > 2. That is, at the 7th trial there are r 
possible outcomes with respective probabilities p v ...,p r (subject, of course, 
1° Pi + ' ' ' +P r — !)• Let N i b e tne number of times the ith outcome occurs 
in n trials. The joint probability that in n trials the ith outcome occurs k t 
times, where k l + • • • + k r = n, is 

P{N l = k v ...,N=k r }= k j n ; , krl Pf l -"Pr K - (5-2) 



36 



Review of Probability and Stochastic Processes 



Equation (5.2) defines the multinomial distribution, which is a multivariate 
distribution, and for which probability-generating functions can be defined 
in a manner analogous to that presented for univariate distributions. We 
shall not go into this subject further, but simply call attention to (5.2), 
which we shall have occasion to use later. Note that (5.2) reduces to the 
binomial distribution when r = 2. 



Geometric Distribution 

In a sequence of Bernoulli trials, the probability that the first success 
occurs after exactly k failures is (1 —pfp, where p is the probability of 
success at any particular trial. Let N be the number of failures preceding 
the first success. (N is often called the waiting time to the first success.) 
Then N has the geometric distribution 

P{N=k} = (\-p) k p (* = 0,1,...). (5.3) 

Notice that N has an infinite number of possible realizations, in contrast 
with the Bernoulli, binomial, and multinomial distributions, which are 
defined on finite sample spaces. For any 0<p < 1 we have 2£_ ~P) k P ~ 
1, so that the distribution (5.3) is proper and there is no need to assign a 
positive probability to the realization N=co. 

The random variable N has probability-generating function 



2 Q k z k =-r^- 



mean q/p, and variance q/p 2 , where q—\ —p. 

From (5.3) the probability that the waiting time to the first success is at 
least k is 



P{N>k}= 2 ri>-P^-q k , ( 54 ) 
j-k 1 * 



which is just what we should expect: q* is the probability that each of the 
first k trials results in failure. 

The formula (5.3) gives the probability that exactly k failures precede 
the first success in a series of independent trials, where each trial has the 
same probability p of success. Since the probability of success at each trial 
is independent of the number of preceding trials, it should be clear that the 
conditional probability that N will equal j+ k, given that N is at least j 
(y = 0, 1,2...), is exactly the same as the unconditional probability (5.3) 
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that N equals k. The definition of conditional probability yields 



P{N=j+k\N>j} 



P{N=j + k,N>j) 
P{N>j} 




(5.5) 



or equivalently, 



P{N>j + k\N>j} = q k . 



(5.6) 



Comparison of (5.6) and (5.4) shows that if N has a geometric distribution, 
then 



This property of independence of past history, called the lack-of-memory 
or Markov property, plays an important role in queueing theory. 

The geometric distribution may be used to describe the duration of 
service of a customer. For the service time might be viewed as the waiting 
time for the first success in a sequence of Bernoulli trials, one trial 
performed per unit of time. (Failure means service continues; success 
means service is completed.) This crude model will prove to be surprisingly 
useful. 

Negative-Binomial (Pascal) Distribution 

The number N of failures preceding the first success in a sequence of 
Bernoulli trials has the geometric distribution (5.3). Let S n be the number 
of failures preceding the nth success. (S n + n is the number of trials up to 

and including the nth success.) Then S n = N l -i \-N n , where Nj (j = 

1, 2,..., n) is the number of failures occurring between the (J— l)th and jth 
successes. The {A^} are mutually independent, identically distributed ran- 
dom variables, each with the geometric distribution (5.3) and probability- 
generating function p/(\-qz). Hence, S„ has probability-generating func- 
tion [p/(l — qz)f. The distribution of S„, the total number of failures 
preceding the nth success in a sequence of Bernoulli trials, is called the 
negative-binomial or Pascal distribution. Expansion of the probability-gen- 
erating function shows that the negative-binomial distribution is given by 



P{N>j+k\N >j} = P{N>k}. 



(5.7) 





(5.8) 



When n = 1, Equation (5.8) reduces to (5.3). 
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Equation (5.8) is perhaps more easily derived through direct probabilis- 
tic reasoning. The nth success will be preceded by exactly k failures if there 
are exactly k failures and n - 1 successes in any order among the first 
n + k-l trials, and the (« + k)th trial results in success. Any such sequence 
has probability q k p"~ } p> There are ( n+k k ~ l ) ways of choosing the k failures 
out of the first n + k-l trials, and (5.8) follows. 

Since S n can be interpreted as a sum of n mutually independent, 
identically distributed geometric random variables, S„ has mean nq/p and 
variance nq/p 1 . 



Exercises 

8. Verify the parenthetical statement of part c of Exercise 6. 

9. Repeat Exercise 7, with the phrase "binomial distribution (5.1)" replaced 
everywhere with "negative-binomial distribution (5.8)." 

Uniform Distribution 

Let U be a continuous random variable that takes values in (0,0, with 
distribution function P{ U<x) - F(x), 



(x<0), 

y (0<JC</), 



1 (*>*)> 

and corresponding density function f(x) = (d/dx)F(x), 



/(*) = 



~ t (0<x<t), 



(5.9) 



(5.10) 



(x<0, x>t). 
Then U has the uniform distribution, with mean value 

E(U) = ^xf(x)dx=^ 

and variance 

[In general, the uniform distribution is defined over an arbitrary interval 
(t v t 2 ), but for simplicity we have taken ^=0. The standard definition, 
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however, uses the interval (0, 1). Of course, all these definitions are equiv- 
alent, corresponding only to changes in scale and location.] 

According to (5.9), U takes on values between zero and f, and the 
probability that U lies in any subinterval of (0,0 is proportional by l/t to 
the length of the subinterval. The random variable U is said to be uniformly 
distributed on (0, t). Since there is no tendency to prefer one point over any 
other when sampling is performed according to (5.9), a point chosen from 
a uniform distribution is often said to be chosen at random. 

The probability is x/ 1 that a point chosen at random in (0, /) will lie in a 
subinterval of length x, and the probability is 1 — (x/t) that a randomly 
chosen point will lie outside the designated subinterval. Thus, if n points 
are independently chosen at random from (0,/), the probability is (*//)*• 
[1 — (x/t)] n ~ k that k specific points will fall in the designated subinterval 
of length x and n — k specific points outside it. Since there are (£) ways of 
specifying k points out of n, then k out of n points chosen at random from 
(0, i) will fall in any prespecif ied subinterval of length x with probability 



Reference to (5.1) shows that this is the binomial probability with p = x/t. 
Similarly, suppose that (0,/) is divided into r subintervals of respective 

lengths Xj, such that x l -\ +x r =t. Choose n points at random from 

(0, t), and let N, be the number of points that fall within the ith subinterval 
(of length Xf). Then the joint distribution of the numbers of points falling 
in each subinterval is given by the multinomial probabilities (5.2), with 
Pi^xjt. 

In the special — t/r, each subinterval has the same 

probability of containing k points (k = 0, 1,2,...,«) as each other subinter- 
val. Although the numbers of points falling in each subinterval of equal 
length are in general unequal, there is no tendency for any particular 
subinterval to be favored over any other. Thus the uniform distribution has 
the properties one would desire from any distribution so named. 

Let us consider more deeply the question of points uniformly distributed 
over an interval (0,t). We have shown that when the interval (0,/) is 
divided into subintervals, the probability that a randomly chosen point will 
lie in a given subinterval is proportional to the subinterval's length; the 
probability that k of n randomly chosen points will all lie in a subinterval 
of length x is given by (5.1). Let us now turn our attention from the 
discrete distribution describing the number of points in a subinterval to the 
continuous distribution describing the distances between successive points. 
We shall find the distribution of the distance between any two consecutive 
points of n points uniformly distributed over (0, t). 

Suppose that n points £/,,..., f/„ are chosen at random from (0,/). Let 
U {k) be the kth smallest; t/ (1) < t/ (2) < •*• < U (n) . (t/ (fc) is called the kth order 
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statistic.) What is the distribution of the distance U^ k+l} — U ik) between two 
arbitrary successive points? 

We have argued previously that points distributed uniformly over an 
interval are spread evenly throughout the interval in the sense that the 
probability that a particular point will lie in any given subinterval is 
proportional to the subintervaFs length, regardless of the relative location 
of the subinterval within the interval (0, t). This suggests that the distribu- 
tion of the distance between any two successive points should be the same 
for every pair of consecutive points, even when one of the pair is an end 
point. Thus, if we take £/ (0) = and £/ (n+I) =?, then the probability 
P {U (Jl+V) — U^^x} should be independent of the index k. Hence it is 
necessary to evaluate P{U {k+l) — U ik) >x) for only one value of k. The 
evaluation for k = is particularly easy: £/ (1) will exceed x if and only if all 
n points fall in (jc, t). The probability of this occurrence for any one point 
is 1 — (x / 1), and since the points are selected independently of each other, 
the required probability is P{ U (l) >x] =[1 — (x/t)] n . Therefore, the proba- 
bility that any two consecutive points are separated by more than x when n 
points are selected at random in (0, t) is 



The result (5.11) has been obtained through intuitive reasoning. Its 
validity rests on the truth of the symmetry argument that the distribution 
of £/(*) is independent of k, even for the end points 1/^ = and 

We shall now verify (5.11) by a more mathematical argument that is of 
interest in itself. Choose n points at random in (0, t). We wish to calculate 
the probability that, of these n points, exactly one point lies in the 
differential element (&£ + exactly k- 1 points lie in (0,£), and exactly 
n~\— (k — 1) points lie in (£+x,t). The simple diagram in Figure 2.1 
shows that this particular arrangement implies £/ (Jfc+1 )— U ik) >x. 

The probability that any particular point lies in (£,£+*/$ is /(£)*/£ = 
{\/t)d£. The probability that any particular point lies in (0,£> is so 
that the probability that any particular k — 1 (independently chosen) points 
all lie in (0,£) is therefore (£/i) k ~ l . Similarly, for any particular choice of 
n-k points, the probability that all lie in (£+x,t) is {[*-(£+ *)]//}""*. 



p{u ik+l) -u (k) >x} = [i-j) n 

(t/ (0) = 0, U in+X) =t; fc=0, l,...,#t). 



(5.11) 



Figure 2.1 







k — 1 points 




n — k points 



.A. 
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Therefore the probability that, out of n points, a particular point lies in 
+ k — 1 particular points lie in (0,£), an d n — k particular points lie 
in (£ + x, i) is given by the product 

(in^Tv 

This expression represents the required probability for a particular choice 
of the points falling in each interval. Since there are n points, there are n 
different choices for the point that falls in And for each such 

choice, there are j) different choices for the k—l points falling in (0,£). 
Once these k points are specified, then the remaining n — k points falling in 
(£+*,f) are automatically specified. Thus the probability that, out of n 
points, any one lies in (£,£+ d%), any k—l he in (0,|)» and any n — k lie in 
is given by 

Finally, the value £ can be anywhere from zero to t — x, so that 

(*=l,2,...,w; t/ ( „ +1) =f). (5-12) 

We have already shown that P{ U^>x} = [1 — (jc //)]"• Evaluation of 
(5.12) for k - 1 easily shows that P{ U {2) - U w >x}=[l-(x/t)] n . It would 
now be surprising if our intuitive derivation of (5.11) were incorrect. 
Further straightforward (although complicated) calculations would show 
that the result (5.11) is indeed correct. 

Exercise 

10. (Feller [1971].) Find the distribution function of the length of the arc covering 
a fixed point jc on a circle of circumference c when the endpoints of the arc 
are independently uniformly distributed over the circumference of the circle. 
Show that the mean length of the covering arc is f c. [Observe that this latter 
result is the same as would be obtained if the point x were not fixed, but were 
itself chosen randomly. To the extent that this observation could have been 
made a priori on grounds of symmetry, our conclusion (that the mean length 
of the covering arc is |c) is intuitively obvious (because three randomly 
chosen points would "on the average" partition the circumference into three 
equal arcs).] 
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Finally, we note for future reference that if U v U 2 ,-.-,U n are indepen- 
dent and uniformly distributed on (0, /), then the joint distribution function 
of the order statistics t/ (1) < t/ (2) < • • ■ < U (n) is given by 

P{U (l) <X V .,., U (n y<X„} 

V nl /*lW *2-*l V* ( X n- X n-\ \ J - 

u,jtj n) J^'''jAt){ t )'"{ t )> (5 * 13) 

where 0<x l <x 2 < • • • < x n < t, and the summation is extended over all 
those sets of nonnegative integers {j\J 2 >'-- J n ) f° T which 

"+Jn = n and J1+J2+ • ' ' +Jk> k (*-l,2,...,it-l). 

To prove (5.13), observe that its right-hand side is simply a sum of 
multinomial probabilities, each being the probability that, out of n inde- 
pendent realizations, exactly j x values fall in the interval (0,^^, and jointly 
j 2 values fall in (jx^jCj), and so on. The sum is taken over all values of the 
indices such that, simultaneously, the number of values in (0,*,) is at least 
one (that is,y, > 1), the number of values in (0,^2) is a * least two (that is, 
J 1 + Ji ^ 2), and so on. 

Negative- Exponential Distribution 

Let X be a continuous random variable with distribution function P{X < 

x} = F(x), 

F{X) J'-^ (*><>)> (5.14) 
w |o (*<0), v } 

and with corresponding density function f(x) = (d/dx)F(x). Thus 

f(x) = l xe->" (x>0). (5.15) 

Then X has the negative- exponential distribution with mean E(X) = fi~ l 
and variance V(X) = p~ 2 . (For convenience, the adjective "negative" is 
often omitted from the name.) 

In the discussion of the geometric distribution, we showed [see Equation 
(5.4)3 that in a sequence of Bernoulli trials, each trial having probability of 
success p, the first k trials will all result in failures with probability (1 —p) k . 
That is, (1 —pf is the probability that the waiting time (number of trials) 
preceding the first success is at least k. 

The mean number of successes in k independent Bernoulli trials is kp. 
Hence, if k trials are conducted in a time interval of length x, then the 
success rate — that is, the mean number of successes per unit time, say 
ju — is 

kp 
/i= — . 

x 
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Now let />->0 (and therefore fc— »oo) in such a way that the rate ju remains 
constant. That is, we envision an experiment with an infinite number of 
trials performed in a finite length of time x, each trial taking vanishingly 
small time and each trial with vanishingly small probability of success, but 
with the mean number of successes being equal to the constant fix > 0. We 
have 



This equation shows that the geometric distribution approaches the 
negative-exponential in the limit as the trials are taken infinitesimally close 
together, each with vanishingly small probability of success, but with the 
average number of successes per unit time remaining constant. Thus, 
if X is the random variable representing the (continuous) time to the 
first success, rather than the (discrete) number of trials preceding the first 
success, then P{X >x} = e~' lx , where /a -1 is the mean time to the first 
success. 

We mentioned previously that the number of trials preceding the first 
success in a sequence of Bernoulli trials provides a crude but suggestive 
model for the description of service times. This assertion is further sup- 
ported by the fact that in the limit the geometric distribution approaches 
the negative-exponential distribution, and observation has shown the nega- 
tive-exponential distribution to provide a good statistical description of 
some service-time distributions. For example, it has long been known that 
telephone-call durations are well described by the negative-exponential 
distribution. As we shall see, it is indeed fortunate that this is so; the 
special properties of the exponential distribution greatly simplify what 
might otherwise be intractable mathematics. Because of these simplifying 
mathematical properties, and because data support its use in some im- 
portant applications such as telephony, it is common in the construction of 
queueing models to assume exponential service times. We shall now 
investigate some of these important mathematical properties. 

A continuous nonnegative random variable X is said to have the Markov 
property if for every t > and every x > 0, 



A random variable with the Markov property (5.17) is often said, for 
obvious reasons, to have no memory. 

Recall from Equation (5.6) that a random variable with a geometric 
distribution has this characteristic property of lack of memory. As pointed 
out in the discussion of (5.6), the existence of the Markov property is 
self-evident in that case. It is easy to show that the Markov property of the 




(5.16) 



k-+ oo 
pk=* fix 



P{X>t + x\X>t} = P{X>x}. 



(5.17) 
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(discrete) geometric distribution carries over to the (continuous) exponen- 
tial distribution. Using the definition of conditional probability, we have 

-K' + *) 

P{X>t + x\X>t) = =e~^. (5.18) 

e 

This equation implies, for example, that a call with duration distributed 
according to (5.14) has the same distribution of remaining holding time 
after it has been in progress for any length of time t >0 as it had initially at 
t = 0. (This does not mean, of course, that an individual call does not "age," 
but only that the statistical properties of the population of all calls are such 
that one can draw no inference about the future duration of an arbitrary 
call from this population based on only its past duration.) 

Although at first glance one might find the property (5.18) to have little 
intuitive appeal, it should immediately be stocked in one's inventory of 
intuition, since the simplifications to which it leads are enormous. 

Consider, for example, a two-server system with one waiting position, 
and suppose that at some time t there are three customers in the system. If 
the waiting customer waits as long as required for a server to become idle, 
what is the probability that the waiting customer will be the last of the 
three to complete service? 

We assume that service times are mutually independent random vari- 
ables, each with the negative- exponential distribution (5.14). The waiting 
customer will complete service last if and only if his service time exceeds 
the remaining service time of the customer still in service when the first 
customer leaves. But because of the Markov property, this remaining time 
is independent of the previously elapsed time. Therefore, the last two 
customers have the same distribution (exponential) of time left in the 
system; that is, there is no bias in favor of either of the customers, so that 
the required probability is \ . 

The problem was solved without knowledge of the instants at which the 
various customers seized the servers. Even the mean service time was not 
used in the calculation. It should be clear that without the assumption of 
negative -exponential service times this problem would have been much 
more difficult. 

We now turn to the relationship between the exponential distribution 
and the birth-and-death process. Let a random variable X represent the 
duration of some process, and assume that X has the exponential distribu- 
tion function given by (5.14). Suppose that at some time / the process is 
uncompleted. (For example, if X is the duration of a telephone call, then 
we suppose the call to be in progress at time t) Then the Markov property 
(5.18) implies that the process will complete in (t,t + h) with probability 

l_ e -^ =1 _/ 1 _ J ^ + i^)!_ + ... \ = [ih + ( h ) (*_>(>). 
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Conversely, suppose that if a process whose duration is given by a 
continuous random variable X is still in progress at time t, then the 
probability of completion in (t, t + h) is \)h + o(h) as h-+Q for all t > 0. That 
is, assume that 

P{X>t + h\X>t} = l-tih + o(h) (t>0, h^O). (5.19) 
Then, using the definition of conditional probability in the form 

in (5.19) rearranging, and letting h^>0 y we obtain the differential equation 

^P{X>t} = -tiP{X>t}, 

which has the unique solution 

P{X>t} = e-> lt 

satisfying P{X >0} = 1. Thus X is exponentially distributed. 

Finally, note that the Markov property P{X >x+y\X >x) - P{X >y) 
and the definition of conditional probability, 

together give the functional equation 

P I X > x + y } = P { X > x } P { X >y } . 

It can be shown that the Markov property, as expressed by this functional 
equation, uniquely characterizes the negative-exponential distribution. 
More precisely, it can be shown (see, for example, p. 459 of Feller [1968]) 
that if u{x) is defined for x > and bounded in some interval, then the 
only nontrivial solution of the functional equation 

u(x+y) = u(x)u(y) (5.20) 

is u(x) = e~' vc for some constant jn. It follows from this theorem (which is 
stronger than necessary for our conclusion) that if a continuous random 
variable has the Markov property, then it must be exponentially distrib- 
uted. 

Suppose now that X v X 2 , .-,X n are independent, identical, exponential 
random variables, and consider the corresponding order statistics ^ ( i)< 
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X (2) < ••• <X (n) . Observe that X in ~mxa.(X v X 2 ,...,X ll ) will exceed x if and 
only if all X t (/= 1,2,...,«) exceed x. Hence P{X (l) >x] = (e "'")'' = e""' ur . 
We have proved the important fact: 

P {min^X,, . . .,X„) >x) = e-"^; (5.21) 

that is, the distribution of the minimum of n independently chosen values 
from a negative-exponential distribution is again a negative-exponential 
distribution, with mean 1/w times the mean of the original distribution. 

For the sake of example, let us identify the random variables X v ...,X n 
with the durations of n independent simultaneously running time intervals, 
so that X^ is the duration of the jth shortest interval. We have just shown 
that the distribution of the duration of the shortest of the n time intervals is 
exponential with mean («ju) _1 ; that is, P{X^>x} = e~ nilx . At the point of 
completion of the shortest interval, the distribution of remaining duration 
for each of the n — 1 uncompleted intervals is, by the Markov property, 
unchanged. Hence, P{X (2) - X w >y} = e~ in ~ X)(iy . The general rule should 
now be obvious. 

Suppose that we are given a number of time intervals whose durations 
X v ...,X n are identically distributed, mutually independent, exponential 
random variables with common mean ju, -1 , and all durations are longer 
than x. We seek the probability that exactly j of these intervals are longer 
than x + 1. Because of the Markov property, the value of x is irrelevant. 
Therefore, let us take time x as the origin: x — 0. 

Now let N(t) be the number of these intervals still uncompleted at time 
t, and let P {N(t)= j] = Pjit). Since the rth (unordered) interval has dura- 
tion X t with the exponential distribution function (5.14), the probability 
that any particular one of these intervals is still uncompleted after an 
elapsed time / is 

P{X i >t}^e-' u (i-l,2 w). 

Since the {X ( } are also mutually independent, it follows that the distribu- 
tion {Pj(t)) is the binomial distribution, 

Pj{t) = {]Ye-^m-e-n n ~ J = 0,1,. ..,«). (5.22) 

This problem can also be viewed in the framework of the pure death 
process. If the system is in state Ej at time t, then the same reasoning that 
led to (5.22) shows that the conditional probability of transition E^Ej in 
(/, t + h) is given by 



P{N(t + h)= j\N(t) =j} = e - Jtth =\- j)ih + o(h) (h-+0). 
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Similarly, the conditional probability of transition E J .^E J _ l in (t,t + h) is 
given by 

P{N{t + h)=j-\\N{t)=j}=je~^-^{\-e^^) 
=jlih + o{h) (/i->0). 

Since we must have 

2 P{N(t + h) = i\N(t)=j} = l, 

i = 

it follows that 

P{N(t + h) <j — 2\N(t) =j } = o(h) (h-+0). 

Thus we have shown that N(t) obeys the postulates of the pure death 
process with death coefficients fy=jti (7=0, l,...,w) and initial condition 
N(0) = n. This is precisely the case we considered in Section 2.2 as an 
example of the pure death process. The corresponding equations are given 
by (2.9) and (2.10), and the solution by (2.11). The solution (2.11) is 
identical, of course, with that just obtained by more direct reasoning, the 
binomial distribution (5.22). [This should not be surprising, since we 
implicitly assumed (5.22) when we specified the transition probabilities.] 

We have thus also shown that if N(t) obeys the postulates of the pure 
death process with ju, = jfi, then the aggregate death process is the same as 
if it were composed of j mutually independent processes whose durations 
are identically distributed negative-exponential random variables each with 
mean n~ l . (But it does not necessarily follow that the aggregate process is 
in fact such a superposition of independent exponential processes.) 

We can calculate from (5.22) the distribution function of the length of 
time required for all n simultaneously running time intervals to complete. 
The time required for all the intervals to complete is the time required for 
N(t) to reach zero; that is, {X (n) <t}o{N(t) = 0). Hence P{X {n) <t} = 
^o(0 = (I ~e~ **')". Since X {n) = max(Xj , X 2 , . . . , X n ), we have proved 

P{max(X 1 ,X 2 ,...,J*;)<x}=(l-e-' t *) n ; 

this result should be compared with (5.21). [Note that max^j,^,...,^) is 
not exponentially distributed.] 

To calculate jE^A^) without performing any integration, let Yj be the 
elapsed time between the yth and the (J — l)th completions, 



7,. = C/=l,2,...,«; X i0) = 0). 
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Then X (n) = Y l + Y 2 H \-Y n , and since Yj is exponentially distributed 

with mean [(«— j+ l)ju] _1 , it follows that 

E(X w )~ — + f * + •--+-. 

w n/i (n- l)ju, ju 



[Thus we have deduced that 



For example, consider a queueing system with n customers in service 
and no customers waiting at time zero, and assume that no new customers 
are allowed into the system. If the service times are independently ex- 
ponentially distributed according to (5.14), then we let N(i) be the number 
of servers busy at time and X (n) be the length of time until all servers 
become idle. 

In another common application, reliability theory, a device depends on n 
independent components, each assumed to have a length of life distributed 
according to (5. 14). The device continues to operate as long as at least one 
of the n components is functioning. Then we can let N(t) be the number of 
components still functioning at time t, and X {n) be the time at which the 
device fails. 

Finally, suppose that X x and X 2 are independent nonnegative random 
variables with distribution functions F x (t) and F 2 (t). Then the probability 
that X 2 exceeds X Y is 

P{X 2 >X X } = r[l-F 2 (t)]dFM* 

If we assume further that X x and X 2 are exponential with means jif' and 
jUj -1 , then the integral on the right-hand side becomes 



We conclude that 

P{X,<X 2 } = -^-; (5.23) 

lit T lb, 



*We are using here the notation of the Riemann-Stieltjes integral, which will be discussed 
in Chapter 5. For the present, the reader should simply interpret the Riemann-Stieltjes 
integral J g(t)dh{i) as the ordinary Riemann integral with dh{t) = h'{t)dt; that is, / g(r)<ft(f) 
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that is, the probability that one exponential variable "wins a race" with 
another equals the ratio of the winner's rate to the sum of the two rates. 



Exercises 



11. Let X x ,...,X n (n>2) be mutually independent, identically distributed, ex- 
ponential random variables. Define the range R=*max(X x ,X 2 , ...,X n ) — 
mm(X x ,X 2 ,...,X„). Show, with no calculation, that the distribution function 
P{R <x} of the range is given by P{R<x}=(l-e~ ttx ) n ~ l . 

12. Let X u ...,X n be a sequence of mutually independent random variables, and 

let Xj be exponentially distributed with mean 070" Let S„ — X x -\ +X n . 

Show that S n has distribution function P{S n </}-(l-e"' tf )". [Hint: No 
calculation is necessary; compare with the maximum of n exponential vari- 
ables.] 

13. In reliability theory the failure rate function r(t) is defined such that r(t)dt can 
be interpreted as the probability that a component that is still operational at 
time / will fail in (t,t + dt). Show that if /(/) is the density function of 
component lifetimes, with corresponding distribution function F{t), then 

K) l-F(t) 

Conclude that if component lifetimes are exponentially distributed, then the 
failure rate is constant. 

14. Let X x ,X 2 ,...,X n be independent exponential random variables with means 
Hi 1 ,n 2 ~ l ,...,fi^' 1 . Show that 

P {X i = mm(X 1 ,X 2 ,...,X„)}= ■ , * , ■ (/- U,...,«). 



15. Let X x and X 2 be independent exponential random variables. Show that 

P {TDm(X x ,X 2 )>t\X x <X 2 ) - P{mm(X lf X 2 ) >t) 
or, equivalently, 

P{X, >t\X i = min(X l ,X 2 )} = P {min(^„^ 2 ) >t } (i = 1,2) 
[even when E{X x )^E(Xd[. 

16. At t — a customer (the test customer) places a request for service and finds all 
s servers busy and j other customers waiting for service. All customers wait as 
long as necessary for service, waiting customers are served in order of arrival, 
and no new requests for service are permitted after f = 0. Service times are 
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assumed to be mutually independent, identically distributed, exponential 
random variables, each with mean duration n~ l . 

a. Let X x be the elapsed time from t =0 until the customer at the head of the 
queue enters service, and let X t (/ = 2,3,...J + 1) be the length of time that 
the z'th customer spends at the head of the queue. (X J+X is the test 
customer's time spent at the head of the queue.) Show that X X ,X 2 ,...,X J+X 
are mutually independent exponential random variables, each with mean 

(*/»)" ! - 

b. Find the expected length of time the test customer spends waiting for 
service in the queue. 

c. Find the expected length of time from the arrival of the test customer at 
t = until the system becomes completely empty (all customers complete 
service). 

d. Let X be the order of completion of service of the test customer; that is, 
X=m if the test customer is the mth customer to complete service after 
/ = 0. Find P{X=m) (m = \,2,.„,s+j+l). 

e. Find the probability that the test customer completes service before the 
customer immediately ahead of him in the queue. 

Poisson Distribution 

Thus far we have discussed the exponential distribution mainly with 
respect to its use as a description of service times. But just as it may be 
useful as a description of durations of service, likewise it may be useful as 
a description of durations of elapsed time between customer arrival 
epochs. More generally, we consider points on a line (or customers arriving 
in time), and we make the assumption that the distances between succes- 
sive points are independently, identically, exponentially distributed. That 
is, we label these points T V T 2 ,..., and assume that the random variable 
Xj ■= Tj — Tj_ , (J — 1, 2, . . . ; r = 0) has the exponential distribution 



We ask the question: If the distances (lengths of time) X j =T j —T j _ x 
between successive points are independently distributed according to 
(5.24), what is the distribution of the number of points occurring in a fixed 
interval of length ft 

Let N(t) be the number of points occurring in (0,/), with P{N(t)=j} = 
Pj(t). [More formally, we say that {N(t% t>0] is a counting process,] Let 
the lengths of time between successive points be independent and have the 
exponential distribution (5.24). Then the probability that at least one point 
will occur in (t,t + h) is 1 — e~ M =)Ji + o(h) as h—>0. And the probability 
that exactly one point will occur in (/, t + h) is 



P{Xj>t}-e 



-Xt 



= 1,2,...'). 



(5.24) 




o 
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Thus the probability of occurrence of two or more points in (t,t + h) is 
l-e~ Xh -\he- Xh = o{h) as /z->0. 

Hence, N(t) satisfies the postulates of the pure birth process with A y = A 
(J — 0, 1,2,...). This case has already been discussed in Section 2.2. The 
problem formulation is given by (2.5) and (2.6), and the solution by (2.7): 

Pj(t)=Qffe-*< = 0,1,...). (5.25) 

The probabilities given by (5.25) constitute the ubiquitous Poisson distribu- 
tion. N(t) has probability -generating function g(z) given by (4.3), 

g(z) = e- x « l -*\ (5.26) 

with mean and variance given by (4.7) and (4.10): 

E(N(t))=V(N(t)) = \t. (5.27) 

Because of the Markov property of the exponential distribution, it is not 
necessary that counting start at any special point T o =0; the formula (5.25) 
gives the distribution of the number of points occurring in any interval or 
set of disjoint subintervals of total length t. 

Conversely, when the distribution of the number of points occurring in 
any interval of length / (regardless of the number of points occurring in 
any preceding interval) is given by the Poisson distribution (5.25), then the 
distances between successive points are mutually independent, identically 
distributed, exponential variables, with common mean A -1 , Setting 7* =0 in 
(5.25) shows that the distance to the first point is exponentially distributed. 
Suppose that the first point is located at T x = t. Since (5.25) is assumed to 
hold for any interval of length /, then the probability of j points in (t, r+t) 
is again given by (5.25). Thus the distance between the first point and the 
second point is independent of and has the 'same distribution as the 
distance to the first point. Repeated use of this reasoning proves the 
assertion. 

Thus Equations (5.24) and (5.25) are equivalent: In a pure birth process 
with parameter A (called a Poisson process with rate A), the number of 
points occurring in a fixed interval of length / has the Poisson distribution 
(5.25), or equivalently, the lengths of the intervals separating successive 
points are independent and have identical, negative-exponential distribu- 
tions (5.24). 

Exercise 

17. Suppose customers arrive at instants T lt T 2 ,..., where the interarrival times 
are independent and have common distribution function P{Tj— T J _ l <x} = 
G( x ) C/ = l»2,...; T o =0). (This is called recurrent or renewal input.) Let Pj(t) 
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be the probability of j arrivals in an interval of length /. Use the fact that 



to prove again: When customer interarrival times are mutually independent 
random variables, each with the negative-exponential distribution with mean 
interarrival time \~\ then the number of customers arriving in a fixed 
interval of length t has the Poisson distribution with mean \/. 

The Poisson process is often used as a description of the input process in 
queueing systems. That is, (5.24) is assumed to describe the customer 
interarrival-time distribution, and (5.25) to describe the distribution of the 
number of customers arriving in a fixed interval of length /. An input 
process obeying (5.24) and (5.25) is called, naturally enough, Poisson input. 
Because of its mathematical properties, the assumption of Poisson input in 
the analysis of queueing systems often leads to the easiest mathematics. 
Specifically, we have, as a consequence of the Markov property, the 
following important facts: 

With Poisson input at rate \, the distribution of the length of time from 
an arbitrary instant until the next arrival epoch is the same as the 
distribution of the length of time between successive arrival epochs, that is, 
exponential with mean X -1 . Similarly, looking backward from an arbitrary 
instant, the distribution of the length of time separating an arbitrary 
instant and the preceding arrival epoch is also exponentially distributed 
with mean A ~ 1 (assuming, of course, that the process is not measured from 
a finite origin). 

We now use the Markov property to illustrate a point that may be 
confusing to the beginner. Suppose that, at an arbitrary instant, one begins 
to observe a Poisson process, and asks for the mean length of time that will 
elapse until the next arrival epoch. According to the Markov property, the 
distribution of the length of time preceding the next arrival is the same as 
the interarrival-time distribution. Therefore, the observer will wait a mean 
length of time X ~ 1 for the occurrence of the next arrival. But the observer's 
waiting time is, in general, less than the time between the two bracketing 
arrival epochs (see Figure 2.2). Hence the mean time separating the 




with 



P (t) = l-G(t) 



Figure 2.2 



Observer's sampling 
point 
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bracketing arrival epochs has mean value larger than A -1 , in apparent 
contradiction to the assumption that X -1 is the mean duration of an 
arbitrary interarrival interval. 

Of course, there is no contradiction. The answer lies in the realization 
that the interval during which the observer samples is not an arbitrary 
interval. A little reflection should convince the reader that an observer 
sampling at arbitrary instants is more likely to sample during a long 
interarrival interval than a short one. Thus the act of observing introduces 
a bias; the mean length of the sampling interval exceeds that of the 
arbitrary interarrival interval. This phenomenon, the biasedness of the 
sampling interval, is true in general, not just for Poisson input. We have 
mentioned it here because the use of the Markov property makes the 
existence of the phenomenon clear. We shall discuss the general problem 
in some detail in Chapter 5. 

The geometric distribution describes the waiting time preceding the first 
success in a sequence of Bernoulli trials; the binomial distribution de- 
scribes the number of successes in a fixed number of Bernoulli trials. 
Similarly, the exponential distribution describes a length of time preceding 
an event, while the Poisson distribution describes the number of such 
events occurring during a fixed time interval. We have shown [see Equa- 
tion (5.16)] that the geometric distribution function approaches the nega- 
tive exponential in the limit as the trials are taken infinitely close together, 
each with vanishingly small probability of success, but with the average 
number of successes per unit time remaining constant. We should not be 
surprised, then, that a similar limiting process produces the Poisson distri- 
bution from the binomial: 

lim = 0,1,...). (5.28) 

n— *co 
np~Xx 

Arrivals occurring according to a Poisson process are often said to occur 
at random. This is because the probability of arrival of a customer in a 
small interval of length h is proportional to the length h, and is indepen- 
dent of the amount of elapsed time from the arrival epoch of the last 
customer. That is, when customers are arriving according to a Poisson 
process, a customer is as likely to arrive at one instant as any other, 
regardless of the instants at which the other customers arrive. This is 
similar to the property noted previously for points chosen independently 
from a uniform distribution on (0, t). However, in that case, the length t of 
the interval containing the points is known. One might conjecture, there- 
fore, that when it is known that a given number n of arrivals generated by 
a Poisson process have occurred in an interval (0, t), these arrival epochs 
are (conditionally) uniformly distributed throughout the interval. This is 
indeed true. 
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More precisely, we shall now prove that if the number N(t) of Poisson 
arrivals in (0,f) is n, then the joint distribution of the arrival epochs 
T v T 2 ,...,T n is the same as the joint distribution of the coordinates 
arranged in increasing order of n independent points, each of which is 
uniformly distributed over the interval (0, t). 

Let 0<jc, <x 2 < • • - <x n <t. Then, using the definition of conditional 
probability, 

P{T l <x i ,T 2 <x 2 ,...,T n <x ft \N(t) = n} 

_ P{T l <x l> T 2 <x 2 ,...,T„<x„,N(t) = n} 
P{N(t) = n] 

we can write, with x = 0, 



P { F, <x„ T 2 <x 2 , . . . , T n <x n \N(t) = «} 



s n 

{JlJ2,-- Jn} »-l 



(A/)" „_ 



rt! 



where the summation is extended over all those sets of nonnegative 
integers {j x ,j 2 , ...J H } for which j\ +j 2 + • • • +j n - n and 7, +y 2 + • • • > 
A: (A:= 1,2,...,«— 1), Comparison with Equation (5.13) shows that our 
assertion is proved. 

As an application of this result, let us return to the observer who 
samples a Poisson process at an arbitrary time, say t, and finds that the 
mean time from t until the next arrival epoch is the mean interarrival time. 
We consider the problem in more depth: Let T = 0, T ly T 2 ,... be the arrival 
epochs of the Poisson process, and define the forward recurrence time R t as 
the time duration from t until the next arrival epoch; define the backward 
recurrence time or age A, as the time duration between t and the last arrival 
epoch prior to t; and define the covering interval I t = A t + R r I t is thus the 
duration of the interarrival interval that contains (covers) the sampling 
point t. We know from the Markov property that R t is exponential, that is, 

P{R t >x} = e~ Xx ; (5.30) 

and further, A t and R, are independent. 
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Now consider A r It follows from the theorem of total probability that 
for t>x, 

P{A t >x} = 2 P{t-T n >x\N(t) = n}P{N(t) = n}. 
«-o 

By assumption 

and also, since T = and t >x, we have P{t— T >x\N(t) = 0} = 1. There- 
fore, 

P{A t >x)-e-»+ f P[t-T H >x\N(t)-n}&fe-» (5.31) 

From (5.29) we know that, conditional on the event N(t) = «, the joint 
distribution of T Xi T 1 ,... i T n is exactly the same as that which we would 
obtain if n values were chosen independently from a uniform distribution 
over (0,0- Therefore, we can apply (5.11). In particular, we have P{t—T n 
>x\N(t)= n} = P{U (n+l) - U w >x} where U (n+1) =t; thus 

P{t-T n >x\N(t) = n} = (l-^) n . (5.32) 

Insertion of (5.32) into (5.31) easily yields P{A t >x) = e~** (when x<t). 
Hence, we conclude that 

[0 {x>t), 

and therefore, 

lim P{A t >x}=e~ Xx . (5.34) 

Equation (5.33) could have been obtained with no calculation, simply by 
observing that A t >x if and only if no arrival epochs occur in the interval 
(t-x,t). Equation (5.34) has already been anticipated (at the start of this 
discussion, in the second paragraph following Exercise 17) on intuitive 
grounds. 

Thus we have shown that 

lim P{A t >x} = P{R t >x} ( = (5.35) 
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for the Poisson process. We shall later show that 

lim P{A t >x}= lim P{R ( >x} (5.36) 

f— >oo /— »oo 

for more general arrival processes (that do not possess the Markov prop- 
erty), a result that one might anticipate on grounds of symmetry. 

Let us now consider the covering interval I ( = A t + R r We leave it to 
Exercise 18 for the reader to show that 

P{I t <y} = \-e- } *-\ min(j>, t) e (5.37) 

and therefore 

lim P{I t <y} = l-e-** , -\ye-*. (5.38) 

t—fOO 

Observe that according to (5.37) and (5.38), the interarrival interval 7, that 
covers the sampling point does not have the same distribution as does an 
arbitrary interarrival interval. This is in agreement with the observation of 
the "paradox" that motivated the above analysis. 

Exercises 

18. Prove Equation (5.37). 

19. Let F(x,y) be the limiting joint distribution function of the lengths of the 
forward recurrence time and the covering interval, that is, F{x,y) = 
hm,_ >00 P{/? ( <A:, I t <y). We shall show later (without invoking the Markov 
property) that for a Poisson process 

F(x,y) = l-e- Xx -Xxe~ Xy (0<x<y). 

Using this result, 

a. Prove Equation (5.38). 

b. Prove that )im l _> a0 P{R t >x} = e~ x *. [This result is weaker than (5.30), 
which we proved using the Markov property.] 

c Prove that Hm,^ 0O i > {/?, >x, A, >y} = e~ Xx e~ Xy . 

d. Prove Equation (5.34) and thereby show that 

lim P{R t >x,A t >y} = lim P {R t >x}P {A t >y}. 

t—*ao f— >oo 

(This proves independence in the limit; in the text we used the Markov 
property to prove the stronger result that A, and R t are independent for 
any t.) 

e. Prove that, in the limit as f-»oo, the sampling point t is uniformly 
distributed throughout the covering interval; that is, \\m t _ ¥aa P{R t < 1 x\I t ^ 
y)-x/y. 
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Let us now consider a property of the Poisson process that turns out to 
be one of the most important in the context of queueing theory. Consider a 
"system" with states that occur according to some distribution {Pj(t)}; 
that is, Pj(t) is the probability that, at time t, the system is in state Ej. 
Stated differently, Pj(t) is the probability that an outside observer who 
observes the system at time t will find the system in state Ej. Now consider 
the state distribution as seen by the arriving customers. That is, if T V T 2 ,... 
are the instants at which customers arrive, let J7,(0 be the probability that 
the system is in state Ej just prior to time t, where / is now an arrival 
epoch. {15(0} is tnus tne distribution that represents the viewpoint of the 
arriving customer at time /. In general, these two distributions are different: 
nj{t)¥*Pj{t). However, if the arrival epochs T X ,T 2 >... follow a Poisson 
process, then 1^(0 = ^(0 f° r ali f>0 an( * ally = 0,1, — In other words, 
when the input process is Poisson, the arriving customer's distribution 
{Tljit)} and the outside observer's distribution {Pj(t)} are equal: 

nj(t) = Pj{t) (t>0; y = 0,l,...). (5.39) 

For the reader who does not think that the equality (5.39) is remarkable, 
we give the following counterexample. Suppose that a single server is 
reserved for the exclusive use of a single customer, who uses the server 
intermittently. Let E represent the state "server idle" and E x denote the 
state "server busy." Then if at time t the customer makes a request for 
service, he will necessarily find the server idle; hence, IT (0 = L However, 
if an outside observer looks at the server at time t, he may find that the 
server is busy serving the customer; hence P (t)< 1. Thus, II (t)¥=P (t) in 
this example. 

To prove (5.39) we need the following result: If T v T 2 ,... are the arrival 
epochs in a Poisson process and #(0 is the number of arrivals in (0,t% 
then 

P{T l <x 1 ,...,T n <x n \T n+l = t} = P{T l <x l ,...,T n <x n \N(t)=n}. 

(5.40) 

Before proving (5.40), we show that (5.40) implies (5.39): Equation (5.40) 
says that the joint distribution function of the arrival epochs T l} T 2 ,...,T n 
preceding the point / is the same whether t is an arrival epoch (T n+l = f) or 
an arbitrary point. Now the state of any system at any epoch t is 
completely determined by the sequence of arrival times prior to / and the 
realizations of other random variables, such as service times, that do not 
depend on the arrival process. Therefore, when (5.40) is true, the system 
looks the same to arrivals as it does to an outside observer; that is, for any 
system with Poisson arrivals, Equation (5.39) is true. Note that (5.39) holds 
without regard to whether all, some, or none of the customers in the 
Poisson arrival stream join the system (thereby causing a change of state). 
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We shall give further discussion and an alternate proof of (5.39) in 
Chapter 3. To complete the present proof, it remains only to prove (5.40). 
An expression for the right-hand side of (5.40) is given in (5.29)f namely, 

P{T l <x l ,...,T n <x n \N(t) = n} 



2 n 



ft I [H x i x i-i)Y' e ~\( Xt ~ Xi _ t) i -\(/-*„) 



{hj2,...Jn} »="l { Jjl , 



where x = and the summation is extended over all those sets of nonnega- 
tive integers . . ,j„] for which j x + • • • +j„ = n and + +j k >k 
(A:=l,2,...,«-1). 

Now consider the probability on the left-hand side of (5.40). If we agree 
that 

P{T 1 <x 1 ,...,T n <x n \T n+l = t} 

= limP{T 1 <x l ,...,T n <x n \t<T fi + l <t + h), 

h— *0 

then it follows from the definition of conditional probability that 



P{T l <x l ,...,T n <x„\T n + l = t} 

= Um P{T l <x 1 ,...,T n <x„,t<T n+l *t + h} 
h-*o P{t<T n+l <t + h} 

Since the arrival epochs T lt T 2 ,... follow a Poisson process, the denomina- 
tor in (5.42) can be written 

p{t<T n+l <t+h} = ^-e- Xl [\h+o(h)] (*-»o); (5-43) 

and the numerator in (5.42) can be written 
P{T l <x l> ...,T n <x n ,t<T n+l <t+h} 

Xe'^-^lU + oih)] (h-*0). (5.44) 
Substitution of (5.43) and (5.44) into (5.42) yields the right-hand side of 
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(5.41); Equation (5.40) has been established, and consequently, so has 
(5.39). 

Exercise 

20. A bus shuttles back and forth between two stations. At each station the bus 
unloads and all passengers waiting at the station board the bus, which is 
assumed to be of sufficient capacity so that all passengers can board. Let Pj 
be the probability that j passengers board the bus at a station; that is, Pj 
(7 — 0, 1,2,...) is the distribution of the number of passengers on the bus from 
the viewpoint of an outside observer (the bus driver, say). Let IIj be the 
probability that an arbitrary passenger shares the bus with j other passengers; 
that is, IIj (7 = 0, 1,2,...) is the distribution of the number of (other) 
passengers on the bus from the viewpoint of one of the passengers. Show that 
Pj = TIj (that is, the bus driver's viewpoint and the passenger's viewpoint are 
equivalent) if and only if Pj = (a j /j\)e~ a = 0, 1,2,...). [The "if" part of the 
statement provides a discrete analogue of (5.39).] 

Finally, we consider the questions of superposition an4 decomposition of 
a Poisson process. We have already shown [see Equation (4.19)] that if 
iV 1 (/),...,A' n (/) are mutually independent random variables, and JV)(f) has 
the Poisson distribution with mean \t (i=\,2,...,n), then the sum N^t) 

+ ••• +N n (t) also has the Poisson distribution, with mean A^H +\t- 

This implies that the superposition of n streams of Poisson arrivals is again 
a Poisson stream with mean equal to the sum of the means of the 
component streams. 

The question naturally arises: Can a Poisson process conversely be 
viewed as the superposition of any number of mutually independent 
component Poisson processes? That is, can a Poisson stream be decom- 
posed into any number of independent Poisson streams? The answer is yes, 
as we now show. 

Suppose for example, that telephone calls arrive according to a Poisson 
process with parameter A at a device that distributes the calls to n 
machines for processing. If each arrival is directed to machine j randomly 

with probability A y /A, where A=A,H \-\, then the probability that a 

call will arrive at machine j in (t,t + h) is (\j/X)[\h + o(h)]=\jh + o(h) as 
A-»0; and likewise the probability of two or more arrivals at machine j is 
o(h) as /t-»0. Hence machine j sees arrivals occurring according to a 
Poisson process with parameter A y . And because of the Markov property, 
the arrival processes at the different machines are mutually independent. 

Let us consider more formally the question for the case of decomposi- 
tion into two streams. Let {N(t) t t > 0} be the original Poisson process with 
rate A; and suppose that whenever this process generates an arrival, with 
probability A, /A (/ = 1,2) that arrival is counted as having been generated 
by the process {N^t), t>0}, independently of the assignments of all other 
arrivals. (Thus, a sequence of assignments is a realization of a sequence of 
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Bernoulli trials.) We now show that 

P{NM -j, NM = *} = M (5.45) 

J . w. 

from which it follows that {N^t), t > 0} and {N 2 (t), t>0) are independent 
Poisson processes. (The extension to include an arbitrary number of 
processes is straightforward.) 
Now we can write 

P{N l (t)=j,N 2 (t) = k} 

- P {N,{t) =j, N 2 (t) = k\N(t) =j + k}P{N{t) =j + k), (5.46) 
and, by assumption, 

P{N(t)=J + k} = ^^e-*. (5.47) 

Since the assignments of arrivals from {N(t), t>0} to be counted as 
belonging to {N ( (t), t>0] constitute a sequence of Bernoulli trials, it 
follows that the first probability on the right-hand side of (5.46) is the 
binomial: 

P{iV I (0-y^2(0-*|^(0-y + *}-( y ^)(^) / (^)*. (5-48) 

Insertion of (5.47) and (5.48) into (5.46) yields (5.45). 

We conclude: A Poisson process can be decomposed into any number of 
mutually independent component Poisson processes. A process with this 
property is called infinitely divisible (see Feller [1968]). 

The fact that the processes {N^t), t > 0} and {N 2 (t), t>0) are indepen- 
dent of each other may seem surprising. Suppose, for example, that X = 100 
customers per hour, and \ t = A 2 . One might argue that if during an hour 99 
customers have been observed to have been assigned to stream 1, then one 
would expect only a small number of customers to have been assigned to 
stream 2 (because there will be about 100 customers generated on the 
average, and 99 of them have been assigned to stream 1). On the other 
hand, one might argue that the number assigned to stream 2 should be 
relatively large (because a large number have been assigned to stream 1, 
and each stream receives an equal number of customers on the average). 
Since one can argue with equal force for either of two opposite conclu- 
sions, neither should be believed without further analysis. In this case, 
"further analysis" shows that the correct conclusion lies precisely midway 
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between these two extremes: knowledge of the number of customers 
assigned to stream 1 tells us nothing about the number assigned to 
stream 2. 

The process of decomposition can be viewed#as a compound Poisson 
process: Let X X ,X 2 , ... be a sequence of independent, identically distributed 
random variables, and let S N = X x H — • + X N , where N is a random 
variable, independent of X V X 2 , — If N has a Poisson distribution, then 
S N (5' = 0) is said to be a compound Poisson process. 

According to Exercise 4, if X X ,X 2 ,... are discrete random variables with 
probability-generating function /(z), then, in light of (5.26), S N has proba- 
bility-generating function g(/(z))= e - ^ 1- ^ 1 . In particular, if X V X 2 ,... 
are Bernoulli random variables with P{X i =l}=p, then S N has probabil- 
ity-generating function that is, S N also has a Poisson distribu- 
tion: 

P{S N =j} = ^e->*'. (5.49) 

(To see the connection with decomposition of a Poisson process, observe 
that if we take /? = \/X, then we can interpret N as the number of 
customers generated by the original stream during an interval of length t, 
and S N as the number of customers assigned to stream / during that 
interval. However, further argument is needed to show that the streams are 
independent.) 
To derive (5.49) directly, observe that 

P{S N =j}= 2 P{S N =j\N=n}P{N=n], (5.50) 

n=j 

where, by assumption, 

P{N- n } = Q£l e -*. (5. 51 ) 

From the assumption that X V X 2 ,... are Bernoulli variables it follows that, 
for fixed n, the sum X x + • • • + X„ has the binomial distribution: 

P{S N -j\N-n}-(j y ( 1 -p) n - J (n >j). (5.52) 

Insertion of (5.51) and (5.52) into (5.50) yields 

a result that the reader can easily verify. 
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We have shown that the Poisson distribution possesses many elegant 
properties. The simplifications resulting from the exploitation of these 
properties permit the solution of many problems that might otherwise be 
intractable. 

Exercises 

21. Suppose customers arrive according to a Poisson process, with rate A. Let M 
be the number of customers that arrive during an interval of length X, where 
X is a nonnegative random variable, independent of the arrival process, with 
distribution function H(t)= P{X^t), and mean t and variance a 2 . 

a. Show that 

p{m-j}- r&£ e -»dH{t) 0=0,1,...), 

•'o J- 
E(M)=Xt, 

E{M 2 )=\t+\\o 2 + t 2 ), 
V(M) = \t+\ 2 o 2 . 

b. Show that 

MM-A-t^ 'i"-^) u =o,.,...). 

c. Show that E(X \M=j) = E(X) if and only if 

P{M -j) 0=0,1,...). 

Note: It can be shown, using the theory of Riemann-Stieltjes integration, 
which will be discussed in Chapter 5, that 

f-M.-^o.M.-- ( ,-o, ,,...) 

A) J- J- 

if and only if H(t) = when t <t and H(t)= 1 when t >t; that is, when the 
random variable X always takes the (constant) value t. Thus, we have 
proved that if arrivals follow a Poisson process, then E(X\M=j)= E(X) if 
and only if X is constant. 

d. Show that, if the service time is not a constant, then for all / such that 
<#(/)< 1, 

P{X<t\M=j+\}<P{X<t\M=j) = 0,1,...); 
and furthermore, this property of stochastic dominance implies that 



E(X\M=j+l)>E(X\M=j) = 0,1,...). 
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e. Suppose H(t) — 1 — e M< . Show, with no calculation, that M has the geomet- 
ric distribution, 

f. Conclude that if X is exponential with mean then 

E{X\M-j)-j±± = 0,1,...). 

Show how this result could have been obtained with no calculation. 

22. Customers request service from a group of s servers according to a Poisson 
process with mean interarrival time X -1 . Service times are mutually indepen- 
dent, exponentially distributed, random variables with common mean At 
time i-Oan observer samples the system and finds all s servers occupied and 
no customers waiting. 

a. Find the probability that the next arriving customer is blocked. 

b. Let N be the number of customers that arrive prior to the first completion 
of a customer in service. Find P{N—j}. 

c. Find the probability that the next arriving customer finds at least two idle 
servers. 

23. Continuation of Exercise 5 (Section 2.4). Conclude that procedures (a) and (b) 
are equivalent if and only if N„ has a Poisson distribution. 

[Hint: Make the substitution x = x+y,y = to obtain Equation (5.20) of the 
text.] 

24. Consider the single-server queue with an unlimited number of waiting posi- 
tions. Assume that the input process is Poisson with rate X. Let the service- 
time distribution function H(£) be arbitrary, with mean t and variance a 2 , and 
let 

H(j)- f V*<tf/(£). 

[tj(j) is called the Laplace- Stieltjes transform of H(£).] Let N* be the number 
of customers remaining in the system just after a service completion epoch, 
with Ilf = P {N* = j) and probability-generating function g(z) = 1,jL nfz J . It 
can (and in Chapter 5 will) be shown that the distribution {II?} satisfies the 
equations 

n?= Pj m+ ^Pj-i+xn? O=o,i,...), (i) 
i-i 

where 

PJ- f°°^e^dH(0 0=0,1,...) (2) 
is the distribution of the number of arrivals during a service time. 
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a. Show that 



where 



b. Show that 



7-0 



c Show that the normalization equation, 

*0)-l. (5) 

implies 

n$=i- P , (6) 

where p = At < 1, so that Equation (4) becomes 



*'>- < 7-T-ff (1 -'' ) ('<'>• 



d. Show that 



(7) 



*<*»)-p* L 2( ;_ p) ;J - (8) 

e. Show that when service times are exponential, then 

jy-(i- P )p> o=o,i,...), (9) 

which is a geometric distribution. 

This model, sometimes called the M/G/l queue, will be discussed in detail in 
Chapter 5. 



Erlangian (Gamma) Distribution 

Again we consider points on a line (or customers arriving in time) and 
assume that the distances between successive points are independently, 
identically, exponentially distributed. That is, we label these points 
T V T 2 ,... and assume that the random variable X- = Tj— T J _ l (J — 1, 2, ... ; 
T =0) has the exponential distribution, P{Xj>t} = e~ Xt = 1.2,...) [and 
therefore the number of points occurring in (0, /) has the Poisson distribu- 
tion with mean Xt]. We wish to find the distribution of the distance 
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spanning n consecutive points, that is, the distribution of the sum S„ — X x 
+ +X n of n mutually independent, identically distributed exponential 
random variables. 

Because S n is the sum of n interarrival times when the arrivals occur 
according to the process {N(t), t>0}, it follows that the events {S n <x) 
and {N(x)>n} are equivalent; therefore, 

P{S n <x}-P{N(x)>n}- f &fe-**, 

jmcft J' 

where the second equality follows from the assumption that {N(t), t>0] is 
a Poisson process. Hence, if we define F(x) = P {S„ <x), then we have 

F(x)=l- V-^r*"^ (*>0). (5.53) 

j=o J- 

The distribution defined by (5.53) is called the Erlangian distribution of 
order n (n = 1,2,...). [Observe that the right-hand side of (5.38) is an 
Erlangian distribution function of order 2.] The component variables 
X X ,X 2 ,--- are sometimes considered to be phases or stages, and conse- 
quently F(x) is often called the w-phase or w-stage Erlangian distribution 
function. 

The Erlangian density function f(x) = (d/ dx)F(x) can also be written 
down directly as was the distribution function (5.53). Interpret f(x)dx as 
the probability that the nth point lies in the differential element about the 
point x. The probability that exactly n— 1 points lie in (0,x) is [(Xx) n ~ l /(« 
-l)!]e -Xjc ; the (conditional) probability that a new point occurs at jc is 
then \dx. Hence the probability that n— 1 points lie in (0,x) and that the 
nth point occurs in (x,x + dx) is f(x)dx = [(\x)"~ l /(n — ly^e'^Xdx; the 
density function is 

/(^ ^"^V ^A (X>0, jc>0). (5.54) 

[Note that this is essentially the same argument that was made in support 
of Equation (5.43).] A graph of the density function (5.54) is given in 
Figure 2.3. Of course, (5.54) follows directly from (5.53) by differentiation. 

From the interpretation of S n as a sum of n mutually independent, 
identical, exponentially distributed phases, each with mean \~ l (and 
variance A -2 ), we immediately obtain the mean E(S n ) = nX ~ 1 and variance 
V(S n )= n\~ 2 . As might be expected, the Erlangian distribution is easier to 
handle than are distributions in general, but not so easy as the exponential. 

As the number of phases increases and the mean phase length decreases 
in such a way that E(S„) remains constant [E(S n ) = c], the variance V(S n ) 



66 



Review of Probability and Stochastic Processes 




Figure 23. Graphs of the Erlangian density function of order n, for n— 1,2,..., 10. 
The densities have been normalized to have mean unity by taking A=n. Adapted 
from L. Kosten, Stochastic Theory of Service Systems, New York: Pergamon Press, 
1973. 

goes to zero: 

lim V(S n )= lim «X _1 \ _1 =0. 

A— >oo \—*co 
n— *oo n—*co 

nX~ , = c n\~ i = c 

Thus, in the limit, the random variable S n becomes a constant, 5' 00 = c. 
Hence, the Erlangian distribution provides a model for a range of input 
processes (or service times) characterized by complete randomness when 
n = 1 and no randomness when n = oo. 

Observe that the Erlangian distribution is the continuous analogue of 
the negative-binomial distribution, just as the exponential is the continuous 
analogue of the geometric. 

The «-phase Erlangian distribution, where n is a positive integer, is 
actually a special case of the gamma distribution, defined by the density 
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function 

/(jc)= ^r Ae_AJC (/7>0, x>0, x>0) - 

The gamma distribution can be viewed as the generalization of the 77-phase 
Erlangian, where the number p of phases is no longer required to be an 
integer. As expected, we have E(S p )-p\~ l and V(S p )=p\~ 2 . Thus the 
gamma distribution provides a potentially useful generalization of the 
Erlangian distribution. In particular, the gamma distribution extends the 
"sum" S p to include the case of a fraction of a phase, since p can be less 
than unity. Hence the gamma distribution provides a model for a range of 
input processes (or service times), characterized by an arbitrarily large 
variance when />-»0 and A->0 but p\~ 1 = c, and zero variance when p-^00 
and A-»oo, again holding p\ ~ 1 = c. The term hyperexponential is sometimes 
used to describe this distribution, but this term is most descriptive if 
applied only when 0<p < 1. (The term hyperexponential has also been used 
by Morse [1958] to describe a distribution function that is a weighted sum 
of exponential distribution functions.) 

The /i-phase Erlangian distribution arises quite naturally in the consider- 
ation of input processes. Suppose, for example, that a device distributes 
incoming requests for service to two groups of servers on an alternating 
basis. Then the input to each group of servers is Erlangian with n = 2 
phases. (But these input processes are not independent.) 

Similarly, the Erglangian distribution (5.53) can be used to describe 
service times that are assumed to be composed of independent phases, 
each with the same exponential distribution. These phases may correspond 
to reality in the sense that each phase represents an identifiable component 
of the service, or they may be hypothesized in order to fit a theoretical 
distribution to data. In the former case, the number n of phases is equal to 
the number of components of the service time, whereas in the latter case, 
the number n of phases is determined from the data. 

For another example of the use of the Erlangian distribution in queueing 
theory, consider again the premise of Exercise 16: At / = an arbitrary 
customer (called the test customer) arrives (places a request for service) and 
finds all s servers busy and j>0 other customers waiting for service. 
Customers are served in order of arrival, and all customers wait as long as 
necessary for service. Service times are assumed to be exponentially 
distributed with mean fi~ l . We wish to find the distribution of the test 
customer's waiting time. To this end, let X x be the elapsed time from t—0 
until the customer at the head of the queue enters service; and let X t 
(*'=2,3,... J + 1) be the length of time that the ith customer spends at the 
head of the queue. (X J+l is the time that the test customer spends at the 
head of the queue.) Clearly, the test customer's waiting time is the sum 
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X x -i + X J+l . Now, the length of time that each customer spends at the 

head of the queue is the elapsed time from his arrival at the head of the 
queue until the completion of the shortest of the s service times then in 
progress. Because the service times are independent, identical, exponential 
variables with mean n~ l , the duration of time from an arbitrary instant 
until the completion of the shortest remaining service time is also exponen- 
tially distributed, with mean (s(i)~ l . Hence, X l ,X 2 ,...,X J + 1 are all identi- 
cal, exponential variables, with mean (jju) -1 ; furthermore, they are inde- 
pendent. We conclude that the test customer's waiting time W, given that 
(1) the number Q of customers that he finds waiting ahead of him in the 
queue is j and (2) W>0 (because he finds all servers busy), has the 
(J+ l)-phase Erlangian distribution: 



Also of importance in applications is the case where the number of 
exponential variables in the sum is itself a random variable; that is, 
S N = X l + • • • +X N> where X X ,X 2 ,... are independent, identical, exponen- 
tial variables, and TV is a nonnegative discrete random variable, indepen- 
dent of X V X 2 , — (S N differs from the compound random variables we 
have considered previously in that X V X 2 ,... are no longer required to be 
discrete. In a later chapter we shall develop appropriate transform tech- 
niques, analogous to the use of generating functions as in Exercise 4, for 
studying this kind of compound random variable. For the present we use 
only direct calculation.) In particular, suppose that N— 1 has the geometric 
distribution, 




(5.55) 



P{N=l + k}=(\-p)p k (& = 0,1,...). 



(5.56) 



Then, applying the theorem of total probability, 



00 



P{S N >t}- 2 P{S N >t\N-j}P{N-j], 



7=1 



we can write 



p{s„>t}- 2 2 



j=l i=0 



or, equivalently, 



;-0i-0 l ' 



{\-p)pj. 



(5.57) 
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If we interchange the order of summation in (5.57) we have 
and this easily reduces to 

P{S N >t} = e-V-rt Xt . ( 5<58 ) 

Thus we conclude that if S N — X l -\ where X U X 2 ,... are inde- 
pendent, identical, exponential variables with mean A -1 , and N is a 
random variable with the geometric distribution (5.56), independent of 
X X ,X 2 ,..., then S N has the exponential distribution with mean [(1 — p)\]~ l . 

For example, consider a queueing system with high-priority customers 
and low-priority customers: High-priority customers will preempt the 
server from low-priority customers, so that no low-priority customer is ever 
served when a high-priority customer is present. We wish to find the total 
length of time a low-priority customer spends in service under two dif- 
ferent priority disciplines, preemptive resume and preemptive repeat. Under 
the discipline of preemptive resume, whenever a preempted customer 
reenters service he simply continues his service where he left off. Under the 
discipline of preemptive repeat, a preempted customer draws a new value 
of service time from the service-time distribution each time he reenters 
service. 

Clearly, when the priority discipline is preemptive resume, the total 
service time of a low-priority customer is not affected by the number of 
times he is preempted, and thus is unaffected by the existence of the 
high-priority customers. 

Now consider the case when the priority discipline is preemptive repeat. 
We assume that service times of low-priority customers are exponential 
with mean /i -1 , and also that the high-priority customers arrive according 
to a Poisson process with rate a. We shall show that the total time spent in 
service by a low-priority customer is (again) exponentially distributed with 
mean p ~ 1 , independent of the stream of high-priority customers. Thus, in 
this case, the total time spent in service by a low-priority customer has the 
same distribution under both priority disciplines, and is not affected by the 
existence of the high-priority stream. We now show that this surprising 
result can be obtained by application of the argument leading to (5.58). 

Let Y- be the elapsed time between the start of service of a low-priority 
customer after his (J— l) tn preemption and the next arrival of a high-prior- 
ity customer, and let Z, be the duration of the new service time drawn by 
the low-priority customer when he begins service after his (J—\)th pre- 
emption. If we let Xj~min(Yj,Zj), then the total service time of the 
low-priority customer is S N = X x + • • • + X N , where the random variable 
N— 1 is the total number of times he is preempted from the server by the 
arrival of a high-priority customer. 
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It follows from Equation (5.23) that P {Yj<Zj} = a/(a + ju); hence the 
distribution of N is given by (5.56), with p = a/(a + n). Now, observe that 
P {Xj>t} = P {min( Y p Zj)>t\ Y y <Zj] for / = 1, 2, . . . N- 1 ; and P {X N >t) = 
P{min(Y N ,Z N )>t\ Y N >Z N }. It follows from Exercise 15 that when Yj and 
Zj are independent, exponential random variables, the conditions Yj<Zj 
(j=\,2,...,N- 1) and Y N >Z N are irrelevant; thus X l ,X 2 ,... are indepen- 
dent identical exponential variables with mean E(Xj) = (a + n)~ l . Exercise 
15 further implies that N is independent of the random variables X v X 2 , . . . 
(because no inference can be drawn about the value of Xj from knowledge 
of whether j<N or j = N). 

Therefore, Equation (5.58) applies, with p = a/(a + n) and X = a + fi; 
hence, P{S N >t} = e~ lU , and our assertion is proved. 

This result could have been anticipated on intuitive grounds: Since the 
service times are exponential, they have the Markov property, which 
implies that it makes no difference whether a low-priority customer who 
reenters service continues with his old service time (preemptive resume) or 
selects a new one (preemptive repeat). Note that this intuitive argument 
also shows that the assumption of a Poisson arrival process for the 
high-priority customers is superfluous; regardless of the probabilistic struc- 
ture of the process of preemption, the distribution function of the total 
service time for the low-priority customers remains the same. 

Exercises 

25. An operations research consultant is retained to help a highway engineering 
department study the relationship between the flow of vehicles on an express- 
way and the delays suffered by other vehicles that want to merge onto the 
expressway from an entrance ramp. The consultant proposes the following 
model. Let C/j be the time from the instant the merging vehicle moves to the 
head of the entrance ramp until the first car in the expressway stream passes 
the entrance point, and let U k (A: = 2, 3, ...) be the times between the (k — l)th 
and Arth vehicles in the expressway stream. The merging vehicle will start to 
enter the expressway immediately if its required acceleration time V x , say, is 
less than £/ t ; in that case his merging time is, by definition, V v If V x > [/, the 
merging vehicle waits until the first expressway vehicle passes, and then 
begins to enter the expressway if its new required acceleration time V 2 is less 
than U 2 ; in this case the merging time is + V 2 , Thus, the merging time of 

the vehicle on the ramp is f/, H + £/„_ , + V„ if V k > U k for all k - 1,2,. . ., 

n - 1 and V„ < U„. 

Further, asserts the consultant, we can assume that U lt U 2 ,... are indepen- 
dent, identical, exponential variables with mean a -1 , say, and V U V 2 ,... are 
independent, identical, exponential variables with mean /J -1 , say. Of course, 
the consultant admits, the accuracy of numerical values for the merging times 
that will be derived from this model will depend on the accuracy of the 
underlying assumptions; nevertheless, the model should give a good qualita- 
tive picture of the functional dependence of merging times on the parameters 
a and /3. 
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A critic of the model argues that such qualitative information is not helpful, 
since a qualitative description is intuitively obvious to even the most casual 
observer: Merging times increase as a increases relative to j8 _1 . The model 
will not give useful qualitative information because the assumption that the 
vehicles on the expressway can be described by a Poisson process is unrealis- 
tic. Clearly, asserts the critic, the Poisson assumption is valid only when the 
expressway traffic is light; otherwise, a distribution that implies less variability 
in the interarrival times U U U 2 >..- is required. Therefore, concludes the critic, 
the model will be useful only when a more accurate description of the 
expressway traffic is provided. 

Evaluate the arguments of the consultant and the critic. 

26. In the model of Exercise 25, let X be the merging time of a vehicle; that is, X 
is the time from the instant the vehicle reaches the head of the ramp until it is 
merged into the mainstream of expressway traffic. Then, as a consequence of 
the assumptions of the model of Exercise 25, X has the same distribution as 
does the acceleration time V. Suppose now that each vehicle has a characteris- 
tic acceleration time V; that is, the merging time is £/jH + U n -\ + V if 

V>U k tor dL\\k=\,2,...,n-\ and V<U n . 
a. Show that 



b. Assume that V is exponentially distributed with mean /? ~ \ and show that 
E(X) = (p-a)~ 1 when#>a and E(X)=cc when /3 <a. 

c. Show, for the assumption of part b, that the variance V(X ) of the merging 



when 0>2a, and V(X)= oo when j9 < 2a. 

Feller [1971] uses this model of "gaps" as an illustration in his exposition of 
renewal theory. For further references and a discussion in the context of 
vehicular traffic theory, see Ashton [1966] and Haight [1963], It is interesting 
to observe that if one were asked to make an a priori choice between the 
hypotheses of this model and those of Exercise 25, one could make a good 
argument in favor of either; nevertheless, the conclusions that follow from 
these two sets of apparently similar hypotheses are radically different. 



2.6. Remarks 

We have reviewed some topics from the theory of probability and 
stochastic processes that are particularly important in queueing theory. 
There are many excellent books in which these and related topics are 
covered. We mention a few for reference. 



E(X\V=c) = c + (e ac -\)E(U\U<c). 



times is 
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Feller [1968] is perhaps the most widely referenced book on probability. 
It contains valuable material on discrete probability distributions, generat- 
ing functions, and birth-and-death processes. The first chapter of a second 
volume, Feller [1971], treats the uniform and negative-exponential distribu- 
tions. These two books comprise a most thorough and lucid treatment of 
probability theory and its applications. 

Fisz [1963] provides a firm background in probability, covering many 
topics of interest to applied workers. A recent textbook in probability that 
is especially recommended is Neuts [1973], which includes a good discus- 
sion of probability-generating functions and the method of collective 
marks. 

Some recent textbooks that treat the theory of stochastic processes with 
an eye toward applications in queueing theory and related areas are Bhat 
[1972], ginlar [1975], Clarke and Disney [1970], Cox and Miller [1965], 
Karlin [1968], and Ross [1970]. Another book by this last author, Ross 
[1972], gives a good elementary treatment of probability and stochastic 
processes. 

Takacs [1960a] provides short summaries of some important topics in 
stochastic processes, followed by problems with complete solutions. 

Finally, Cohen [1969] and Syski [1960] are two important reference 
books in queueing theory that contain substantial preliminary material on 
probability and stochastic processes. 

There are many other books on queueing theory that do not include 
substantial preliminary material on probability and stochastic processes. 
These books will be referenced in the present text when appropriate and 
will be discussed briefly in an annotated bibliography in the final chapter. 
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3.1. Introduction 

In Chapter 1 we discussed the characteristics of a queueing model, and in 
Chapter 2 we discussed the birth-and-death process. In this chapter we 
shall develop a correspondence between certain queueing models and 
certain birth-and-death processes, and then specialize the results of our 
previous birth-and-death analysis to provide an analysis of these queueing 
models. 

Recall from Chapter 1 that a queueing model can be defined (roughly 
speaking) in terms of three characteristics: the input process, the service 
mechanism, and the queue discipline. The input process describes the 
sequence of requests for service. For example, a common assumption for 
the input process, and one that we shall make in this chapter, is that of 
Poisson (or random) input, where the customers are assumed to arrive 
according to a Poisson process. Another input process we shall study in 
this chapter is called quasirandom input, where each idle source generates 
requests independently and with the same exponentially distributed inter- 
request time. 

The service mechanism includes such characteristics as the number of 
servers and the lengths of time that customers hold the servers. In this 
chapter we shall study primarily models with an arbitrary number of 
parallel servers and with independent, identically distributed exponential 
service times. We shall also consider some variants, such as one in which 
+ he servers are selected in a prespecified order. 

The queue discipline specifies the disposition of blocked customers 
(customers who find all servers busy). In this chapter we shall consider two 
different queue disciplines. When blocked customers do not wait, but 



74 



Birth-and-Death Queueing Models 



return immediately to their prerequest state, the queue discipline is said to 
be blocked customers cleared (BCC). And when blocked customers wait as 
long as necessary for service, the queue discipline is said to be blocked 
customers delayed (BCD). In this latter case, it is sometimes necessary to 
specify the order in which waiting customers are selected from the queue 
for service. We shall concern ourselves primarily with service in order of 
arrival, although we will also outline some results for service in random 
order. 

Our strategy is to exploit the fact that certain queueing processes can be 
modeled as birth-and-death processes. Thus, if we let {N(t), t>0) be the 
number of customers present at time /, then the statistical-equilibrium state 
probabilities, P J = \im. t ^ OQ P{N(t)=j}, are given by Equation (3.10) of 
Chapter 2, which we repeat for convenience: 

1 



0=o), 

i+ 2 (\>V**^fc-i//*i/*2* --/**) 

*-i (l.i) 

— = 1,2,...). 

As we showed in Chapter 2, this equation is equivalent to the "rate 
up = rate down" equations 

\jPj = lx J+l P j+l = 0,1,2,...), (1.2) 

augmented by the normalization equation 

f/> y =l. (1.3) 

The birth rates A, (J = 0, 1,2, ...) and death rates fi. (j — 1,2,...) must be 
chosen to reflect the assumptions about the input process, the service 
mechanism, and the queue discipline. 

For example, consider an infinite-server system, and suppose that 
arrivals occur according to a Poisson process, with rate A, and service times 
are exponentially distributed, with mean ju, -1 , independently of the arrival 
process and each other. (That is, we have Poisson input and exponential 
service times.) Suppose we observe the system at some time t, and suppose 
N(t)=j. Interarrival times are exponential with mean A -1 , and service 
times are exponential with mean ju -1 ; thus the time from / until the next 
event, whether an arrival or a service completion, is also exponentially 
distributed, with mean (\+jn)~K (This follows because the time to the 
next event is the minimum of the exponential interarrival time and the j 
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exponential service times.) Thus, the probability that an event will occur in 
{t, t + h) is (A + j'n)h + o(h) as /t— »0. When an event does occur, the proba- 
bility that it will be caused by an arrival rather than a service completion is 
A/(A+7/i). [According to Exercise 15 of Section 2.5, the length of time 
required for the event to occur and the type of the event (whether arrival 
or service completion) are independent.] Therefore, the probability of 
occurrence of the transition Ej^E J+l in (t,t + h) is 

\(X+jti)h + o(h) 1 t ~~ ; + o{h)= Xh + o(h) (h^O), 
A +jfx 

where the second o(h) on the left-hand side of this equation is the 
probability of multiple arrivals and service completions that could effect 
this transition. Thus, 

P{N(t + h)=j+l\N(t)=j}=\h + o(h) (h^O). (1.4) 

By a similar argument, 

P {N(t + h) =j~\\ N{t) = /} =jiLh + o(h) (h-+0). (1.5) 

The probability that no event occurs in (t,t + h) is e~ {X+J ^ h \ and because 
the probability that more than one event will occur in (t,t + h) is o(h) as 
h-*0, it follows that 

P { N(t + h) =j\ N{t) =j } = e ~ + o(h) 

= l-(\+jn)h + o(h) (h^O). (1.6) 

Again, because the probability of more than one event occurring in 
(t,t + h) is o(h) as h->0, it follows that 

P{N(t + h) = k\N(t)=j} = o(h) (h-*0, \k-j\>2). (1.7) 

Thus, we have shown that in the infinite-server system having Poisson 
arrivals with rate A and exponential service times with mean ju, -1 , the 
number N(t) of customers present at time / obeys the postulates of the 
birth-and-death process with birth rates A y = A 0=0,1,2,...) and death 
rates t^=j[i (j= 1,2,...). [This model has been considered previously — 
see Equation (4.20) of Chapter 2.] Thus, Equation (1.1) applies, and we 
have 

Pj= (^M_ e -x/, = 0,1,2,...). (1.8) 

We return our attention to the general ^-server case. One of the quanti- 
ties of interest that can be calculated from the equilibrium probabilities 
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(1.1) is the carried load a', which is defined as the mean number of busy 
servers at an arbitrary instant in equilibrium: 

The first sum on the right-hand side of (1.9) reflects the assumption that if 
the number of customers in the system is less than the number of servers, 
then all these customers will be in service; the second sum reflects the fact 
that all s servers will be busy if there are at least s customers in the system. 

In view of the definition of carried load as the mean number of busy 
servers, it can be shown (using Little's theorem, which is discussed and 
proved in Section 5.2) that the carried load equals the product of the mean 
service completion rate and the mean service time. 

The server occupancy p is defined as the load carried per server in 
equilibrium: 

P=-. (MO) 

s 

The server occupancy p measures the degree of utilization of a group of 
servers, and is therefore sometimes called the utilization factor. Clearly, we 
must always have p < 1. 

Another important quantity is the offered load a, which is defined as the 
product of the arrival rate and the mean service time. Thus, if A is the 
arrival rate and t is the mean service time, then 

a = Ar. (1.11) 

The offered load is a dimensionless quantity whose numerical values, 
which provide a measure of the demand placed on the system, are 
expressed in units called erlangs (abbreviated erl). 

In the case of Poisson input, the offered load a is equal to the mean 
number of arrivals per service time. That is, let X be an arbitrary service 
time, with distribution function P{X<t) = H{t) and mean value t = 
fgtdH(t); and let M be the number of arrivals during the service time X. 
If we let pj = P { M =y }, then 

Pj~ r^-e-^dH(t) = 0,1,2,...). (1.12) 
•'o J- 

Now E(M) = 2,JLJp J = fo\tdH(0, where the last equality follows after 
using (1.12) and interchanging the order of summation and integration. 
Comparison with (1.11) shows that E(M) = a for Poisson input, as 
asserted. 
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For example, consider again the infinite-server model with Poisson 
arrivals, with rate A, and exponential service times, with mean To 
calculate the carried load, substitute (1.8) into (1.9). Note that in this case 
(s = ao), the definition (1.9) reduces to the mean value of the (Poisson) 
distribution (1.8); hence, a' = \/n. To calculate the offered load, use the 
fact that when H(f) — 1 — e - '", the mean service time t is given by t = fi~ '; 
then (1.11) yields a = \/ii. Thus we see that in this example a' = a. This 
equality of carried and offered loads reflects the fact that in this model, all 
arrivals are served (because s = oo), and therefore the mean number of 
service completions per unit time equals the mean number of arrivals per 
unit time. 

Before proceeding to the application of Equation (1.1) to queues with 
particular choices of the input process, service mechanism, and queue 
discipline, we direct our attention to the relationship between the probabil- 
ities given by Equation (1.1), which describe the system at an arbitrary 
instant during equilibrium, and the probabilities that describe the system at 
the instants at which the customers arrive. 

3.2. Relationship between the Outside Observer's Distribution and 
the Arriving Customer's Distribution 

In Section 2.5 we showed [see Equation (5.39)] that for systems with 
Poisson input, the arriving customer's distribution {TIj(t)} and the outside 
observer's distribution {P/j)} are equal: 

n j (t) = P j (t) (/>0; y = 0,l,...). (2.1) 

That is, when customers arrive in a Poisson stream, the probability that 
there are j customers present in the system at any time t is the same 
whether or not t is an arrival point. We also showed by counterexample 
that (2,1) is not true in general. 

We now give an alternative argument in support of (2.1) for systems 
with Poisson input. Let C(t,t + h) be the event {a customer arrives in the 
interval (t,t + h)}, and define 

IL(t)= lim P{N(t)=j\C(t,t + h)}, (2.2) 

A-»0 

where, as before, N(t) is the number of customers present at time /, and we 
use the notation Pj(t) = P{N(t)—j). [Note that the occurrence of the event 
{C(t,t + h)} does not imply that the arriving customer necessarily enters 
the system; it includes the case where the arrival departs immediately, 
without causing a state transition.] 

Consider the probability P{C(t,t + h)\N(t)=j). By assumption, the 
customers arrive according to a Poisson process; hence, the probability 
that an arrival occurs in any interval of length h is Xh + o(h) as h-+0, 
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regardless of the value of t, j, or anything else. Thus, for Poisson input, 

P{C(t,t + h)\N(t)=j} = P{C(t,t + h)}; 

that is, the events {C(t,t + h)} and {N(t)=j} = 0, 1,...) are independent. 
It follows from the independence of these events that 

P{N(t)=j\C(t,t + h)} = P{N(t)=j). 

Thus, for Poisson input, Equation (2.2) reduces to (2.1), as was to be 
shown. 

For the purpose of making calculations for systems with non-Poisson 
input, it is often useful to express (2.2) in the form commonly referred to as 
Bayes' rule: 

i}(0 = hm — . (2.3) 

2 P{C{tJ + h)\N{t) = k}P k {t) 

k = 

Of interest in most applications is the equilibrium arriving customer's 
distribution {IIj}, which we define as 

JZ«limJZ(/) = 0,1,...). (2.4) 

J r-»oo J 

In particular, Equation (2.1) yields (for systems with Poisson input) 

IIj-Pj = 0,1,...). (2.5) 

An interpretation of (2.5) [and (2.1)] is: Since Poisson arrivals occur 
randomly in time, the proportion TIj of arrivals that occur when the system 
is in state E. equals the proportion Pj of time that the system spends in 
state Ej. 

Note that we have nowhere assumed that the process {N(t), t>0} is 
necessarily a birth-and-death process. If it is, and if P{C(t, t + h)\N(t)=j} 
=Xjh + o(h) as h^>0, then Equations (2.3) and (2.4) yield 

q= $ Pj = 0,1,...), (2.6) 

where Pj = 04, •) is given by (1.1). 



Exercises 



1. a. A single-server queueing system with an unlimited number of waiting 
positions is modeled as a birth-and-death process with \ J = {j+\)~ i \ for 
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j>0 and fij = ti for j>\, where births correspond to arrivals and deaths to 
service completions. Show that 

p j =Q jr e ~ Xr o--<u,..) 

and 

nj={\-e- x *y'p j+x 0=0,1,...), 

where t = /a _1 . Calculate the carried load a'. Convince yourself that the 
arrival rate equals SJLo ^j^p an< * calculate the offered load a. Verify that 
a = a'. 

b. A single server with exponential service times with mean n~ l serves 
customers who arrive in a Poisson stream at rate A. An arriving customer 
who finds j other customers in the system will, with probability y'/O+l)* 
depart (renege) immediately; all customers who do not depart immediately 
wait as long as necessary for service. Find {Pj} and {IIj). Find the carried 
load a' and the offered load a, and verify that the probability that an 
arbitrary arrival does not receive service equals the ratio (a — a')/ a. 

c Customers arrive in a Poisson stream at a single server with an infinite 
number of waiting positions. Observation indicates that the server works 
faster as the queue size increases. If this system is described by a birth-and- 
death model with A^ =A and iL f =jfi = 0, 1,...), find {Pj} and {IIj}. Calcu- 
late the carried load a'. By considering the queue discipline in which (i) each 
arrival enters service immediately, preempting the customer in service (if 
any) and moving him back to the head of the queue, and (ii) the customers 
in the queue are served in reverse order of arrival, convince yourself that the 
mean service time equals SJLo^ty+V Calculate the offered load a, and 
verify that a — a'. 

For a list of references and discussion of the models of this exercise from a 
different point of view, see Conolly and Chan [1977], and also Brumelle [1978]. 

2. Customers arrive at a two-chair shoeshine stand at rate 10 per hour. The 
average length of a shoeshine is 6 minutes. There is only one attendant, so that 
one chair is used as a waiting position. Customers who find both chairs 
occupied go away. 

a. Assuming Poisson input and exponential service times, write and solve the 
statistical-equilibrium probability state equations. 

b. Find the mean number of customers served per hour. 

c. Repeat the analysis to calculate the mean number of customers served per 
hour when there are two attendants at the stand (and no waiting positions). 

3.3. Poisson Input, s Servers, Blocked Customers Cleared: 
The Erlang Loss System 

In this model, we assume that customer arrivals follow a Poisson process 
with rate X; that service times are exponentially distributed with mean 
service time independent of each other and the arrival process; and 
that customers who find all s servers busy leave the system and have no 
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effect upon it, that is, blocked customers cleared (BCC). (However, the 
assumption of exponential service times will be seen to be unnecessary.) 
An application of the BCC assumption arises in telephone traffic engineer- 
ing, where calls that find all trunks busy are given a busy signal. Other 
applications should be apparent. 

Since customers arrive at random with rate \, but effect state changes 
only when j<s (because blocked customers are cleared), we write 

| A when y = 0,l,...,j-l, , . 

Mo when y-,. (3 ' 

Because service times are exponential, when there are j customers in 
service the rate ^ at which service completions occur is 

l*?=Jp = 1,2,...,.?). (3.2) 

The rates (3.1) and (3.2), in conjunction with (1.1), give for the statisti- 
cal-equilibrium probability of j busy servers 

Pj- 0-0,1,...,') (3.3) 

^ (Vm)* 

and Pj — for j>s. The distribution (3.3) is called the truncated Poisson 
distribution. Note that the rates \ and /* appear in (3.3) only through the 
ratio \/n, which, according to Equation (1.11), is the offered load a. We 
call a system with Poisson arrivals and blocked customers cleared [whose 
equilibrium-state probabilities are given by (3.3) for any service- time 
distribution] an Erlang loss system. (A. K. Erlang, 1878-1929, was a 
Danish mathematician who laid the foundations of modern teletraffic and 
queueing theory — see Brockmeyer et al. [1948].) 

The probability that all s servers are busy is given by (3.3) with j = s. 
Formula (3.3) with j — s is called the Erlang loss formula or Erlang B 
formula in the United States and is denoted by 

a s /s* 

B(s,a) (3.4) 

2 a k /k\ 

k = 

Likewise, the truncated Poisson distribution (3.3) is also known as the 
Erlang loss distribution. In Europe, the right-hand side of (3.4) is called 
Erlang' s first formula and is denoted by E lfS (a). The formula (3.4) was first 
published by Erlang in 1917. As already suggested, the Erlang loss formula 
has found extensive application in the field of telephone traffic engineer- 
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ing. Curves of (3.4) for fixed values of s are plotted against increasing 
values of a in Figures A.l and A.2 of the appendix. Equation (3.4) is also 
tabulated in Dietrich et al. [1966]. Further mathematical and numerical 
properties of the Erlang loss formula are discussed in Jagerman [1974]. 

We emphasize that (3.4) gives both the proportion of time that all s 
servers are busy and the proportion of arriving customers* who find all j 
servers busy (and thus are lost). Note also that (3.4), which we have 
derived from first principles, is the same as (1.6) of Chapter 1, which was 
derived through a heuristic argument. We have now answered some, but 
not all, of the questions raised in Chapter 1 about the validity of this 
formula. 

For the Erlang loss system, the carried load a', calculated directly from 
the definition (1.9) and the probabilities (3.3), is easily shown (see Exercise 
3) to be 

a' = a[\-B{s,a)\ (3.5) 

This result can be interpreted to say that the carried load a' equals that 
portion of the offered load a that is not lost. (In particular, when s=oo, 
then a' = a; that is, the offered load is the load that would be carried if 
there were no blocking.) Similarly, we can interpret aB(s,a) as the lost 
load. Then it follows (in this case) that the proportion of arrivals that is lost 
equals the ratio of the lost load to the offered load. 

Exercise 

3. Derive (3.5) from the definition (1.9) and the probabilities (3.3). 

Investigation of (3.4) and (3.5) leads to the following important conclu- 
sion: As the number of servers is increased and the offered load is 
increased in such a way that the probability of blocking remains constant, 
the server occupancy [defined by Equation (1.10) as the carried load per 
server] increases. In other words, large server groups are more efficient 
than small ones. 

Unfortunately, in practice hardware limitations sometimes preclude ex- 
ploitation of this fact. Moreover, high-occupancy server groups are more 
vulnerable to service degradation during periods of overload than are 
smaller server groups with the same blocking probability but lower oc- 
cupancy. 

For a server group of fixed size, occupancy increases with increasing 
load, thereby increasing server-group efficiency; unfortunately, the proba- 
bility of blocking also increases with increasing load. Hence, efficient use 
of equipment must be balanced against the provision of acceptable service 
for the customers. 
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Exercises 

4. Consider an Erlang loss system with 10 servers. Measurements show that about 
1% of the arriving customers are lost (denied service). It is estimated that the 
rate of requests for service will double over the next year. How many servers 
must be added so that the system will provide the same grade of service (that 
is, the same probability that an arrival will find an idle server) to the customers 
in the face of the increased demand? 

5. An entrepreneur offers services that can be modeled as an j-server Erlang loss 
system. Suppose the arrival rate is 4 customers per hour; the average service 
time is 1 hour; the entrepreneur earns $2.50 for each customer served; and the 
entrepreneur's operating cost c is $1.00 per server per hour (whether the server 
is busy or idle). What is the optimal number of servers, and what is the hourly 
profit earned when the optimal number of servers is provided? What is the 
maximum value of c beyond which it is unprofitable for the entrepreneur to 
remain in business? 

6. Show that 

aB(s — l,a) 



B(s,a) = 



s + aB(s — l,a) 
How might this recurrence be computationally useful? 

7. Consider an Erlang loss system with retrials. Any customer who finds all s 
servers busy will return and place another request for service at some later 
time, and will persist in this manner until he receives service. Let a=\r, where 
t is the average service time and \ is the rate at which the original requests for 
service occur. (That is, X is the arrival rate with retrials not counted.) Since all 
customers are served, the carried load equals the original offered load a=\r. 
Therefore, it can be argued, the probability of blocking (the probability that a 
customer requesting service will find all servers busy and be forced to retry) is 
given by B(s, a), where a satisfies the equation 

a[\- B(s,a)] = a. 

Discuss the strengths and weaknesses of this approximate analysis of the 
Erlang loss system with retrials. (See Riordan [1962].) 

8. Consider an equilibrium j-server Erlang loss system with exponential service 
times and offered load a erl. A statistician observes the system at a random 
instant and waits until the next customer arrives. Show that the probability p 
that this customer is blocked is 



Explain why p^B(s,a). 
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9. The Erlang loss system as a semi- Markov process. A (stationary) Markov chain 
is a stochastic process {X„, n=0, 1,2,...} whose values are countable, and for 
which the transition probabilities satisfy 

P{X H +i —j\X n = iiXn-\~ *n- i> • • • > X] = i' l9 X = i Q ] 

= P{Xn + X=j\X n = i}- Pi j\ (1) 

that is, for each n, the conditional probability that the process will have value j 
(be in state EJ) after the (n+ l)th step depends only on the state of the process 
after the nth step, and is independent of the past history of the process prior to 
the nth step. If we define 

f - J™r{x*-Mo-i}, (2) 

then it is known that, under certain conditions, the limiting distribution Pf 
(J — Q, 1,2,...) exists and is the unique solution of the equations 

Pr=f> p TPu = 0,1,2,...) (3) 

i-0 

and 

2*7-1. (4) 

J-o 

A semi- Markov process is a Markov chain in which the times between transi- 
tions are random variables. It can be shown that (a) if the Markov chain {X„, 
«=0, 1,2,...} takes only the values 0,1,..., and (b) if, for each n, m ( is the 
mean length of continuous time that X n = i before making a transition, then the 
proportion Pj of time that the process spends in state Ej is the weighted average 
of the probabilities {P*}: 

Pj 0=0,1,...,*). (5) 

i-0 

(These important topics are discussed in most books on probability and 
stochastic processes, such as those referenced in Section 2.6.) 

Now consider the equilibrium, ^-server Erlang loss system with exponential 
service times. Then the distribution {Pj} above corresponds to the equilibrium 
birth-and-death probabilities. Assume that the conditions required for the 
above results are satisfied, and 

a. find p y — 0, 1 , . . . , s); 

b. find w, (/ = 0, l,...,s); 

c. show that Equation (5) agrees with (3.3). 

An important theorem is that all of the results derived above are true for 
any service-time distribution function with finite mean, even though the 
Markov property of the exponential distribution function was used ex- 
plicitly in the above derivation. [Specifically, in using the birth-and-death 
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fonnulation we have assumed that if at any instant t there are j customers 
in service, each with mean service time ju _1 , then the probability that a 
customer will complete service in (/, t + ti) is jph + o(h) as h— >0, indepen- 
dent of the value of t and the lengths of time that the customers have been 
in service.] 

The remarkable fact that the truncated Poisson distribution (3.3) is valid 
for an arbitrary service-time distribution function was conjectured by 
Erlang himself in 1917, and proved by Sevast'yanov [1957]. Subsequently, 
many investigators studied this problem, and a variety of proofs have been 
offered. (See pp. 271-278 of Syski [I960]; see also Takacs [1969], Cohen 
[1976], Oakes [1976], and Brumelle [1978].) It is worthwhile at this point to 
give proofs for the special cases s= 1 and s= oo. A proof for the case of 
arbitrary s will be outlined in Chapter 4. Before proceeding with the 
proofs, however, let us consider the following example, which illustrates 
the utility of the theorem. 

First, consider a server group that serves two independent streams of 
Poisson traffic (for example, a telephone trunk group serving eastbound 
and westbound traffic) on a BCC basis. That is, suppose that arrivals from 
stream / occur according to a Poisson process with rate \, and assume first 
that the customers in each stream have a common (but arbitrary) service- 
time distribution function with mean t. Then the two streams can be 
viewed as composing a single Poisson stream with rate A, + A 2 , so that the 
outside observer's distribution {Pj} is given by (3.3) with \ = \ l +\ 2 . Since 
each stream is Poisson, each stream sees the (same) outside observer's 
distribution, and hence each stream sees the same arriving customer's 
distribution; thus the probability of blocking is the same for each stream. 

Suppose now that the (arbitrary) service-time distribution functions of 
the two Poisson streams are not identical, with stream i having mean 
service time t ( . Arrivals still occur according to a Poisson process with rate 
Xj+Aj. It follows from our theorem that, regardless of the form of the 
composite service-time distribution function, the state probabilities are still 
given by (3.3), where the composite mean service time t remains to be 
determined. Again, since each stream is Poisson, each stream sees the same 
state distribution. Thus, the proportion of served customers who are from 
stream / is the same as the proportion of arriving customers who are from 
stream /, namely A, /A. Therefore, the (composite) mean service time t for 
all customers who are served is 




Multiplying through by A, we obtain 

a = <2j + a 2 , 

where a = At and a^A,^. 
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We conclude that when two streams of Poisson traffic of magnitudes Oj 
and a 2 erlangs, respectively, are offered to a single group of s servers on a 
BCC basis, each stream suffers a blocking probability given by the Erlang 
loss formula (3.4) with a = a l + a 2 . The generalization of this result to 
include an arbitrary number of independent Poisson streams is clearly 
true. 



Exercise 

10. Two independent Poisson streams of traffic, called low-priority and high- 
priority traffic, are handled by a primary group of 10 servers. Low-priority 
customers who are blocked on (overflow from) the primary group are cleared 
from the system, whereas high-priority customers who overflow the primary 
group are routed to a backup of overflow servers. According to measure- 
ments, the high-priority customers require an average of 12 minutes for 
service and arrive at rate 20 per hour, of which about 2 per hour overflow. 

a. If the arrival rate of low-priority customers doubles, by what factor will the 
overflow rate of the high-priority customers increase? 

b. If the overflow group handles the overflow traffic on a BCC basis, is it 
legitimate to use the Erlang loss formula to decide how many servers to 
provide in the overflow group? 

We now give a heuristic argument for the case s = 1 of the theorem: For 
a system with Poisson input and blocked customers cleared, the distribu- 
tion {Pj} of the number of customers present at an arbitrary point in 
equilibrium is the Erlang loss distribution (3.3). 

We suppose that customers arrive according to a Poisson process with 
rate A. An arrival who finds the server idle will hold it for a time interval 
whose duration is a random variable with mean t. An arrival who finds the 
server occupied will leave the system immediately. Hence the server 
alternates between busy and idle states, each busy period of mean length t 
and each idle period of mean length A~ l . This is illustrated in Figure 3.1. 

Consider a single cycle composed of an idle period and an adjacent busy 
period. The cycle has mean length A ~ 1 + t, and thus the ratio of the mean 
busy period to the mean cycle length is 

(3.6) 



where the right-hand side follows from (1.11). This is the same as P v given 



Figure 3-1. 



Busy 
Idle 
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by (3.3) when s=\. Since the ratio of mean values t/(A _1 + t) is indeed 
the same as the proportion of time that the server is busy, we conclude that 
P l = a/(l + a) is the probability that the server is occupied, for any 
service-time distribution function with mean t, as asserted. Since we 
require that P Q + P x = l, it follows that P =l/(l + a), again in agreement 
with (3.3). Hence, the theorem is proved for s= 1. 

Let us consider this single-server model again, this time from the point 
of view of arriving customers. We have argued that the proportion of time 
(or probability) the server is busy is a/(l + a), that is, P = 1/(1 + a) and 
P x = a/(\ + a), and we know that for systems with Poisson input {IIj} = 
{Pj}- We now calculate JT and iTj directly by application of Equation 
(1.4) of Chapter 2: 

Ul l + E(N)* K } 

where N is the number of customers who arrive during an arbitrary busy 
period. When blocked customers are cleared, the busy period is the same 
as the service time; hence E(N) = a. Thus, we conclude from comparison 
of (3.6) and (3.7) that, as promised, TI X = P x and, since JT + n x = l = P + 
P v therefore II = P Q . 

We now turn to the case s = oo. We shall calculate the (transient) 
distribution {Pj(t)} of the number of customers present at time and show 
that for any service-time distribution function H(x) with finite mean 
H- l = f?xdH(x), 

lim P( t )=H^e' x ^ (y = 0,l,2,...); (3.8) 

that is, Mm^^ P j (t) = P p where Pj is given by (3.3) with s = oo. 

To begin, consider an arbitrary customer (the test customer) who is 
assumed to have arrived at some time t during the interval (0,/). We shall 
calculate the probability p(t) that the test customer is still present at time t. 

Let X be the service time of a customer, with distribution function H(x). 
The test customer will be present at time / if either (a) X >t or (b) X<t 
and t + X>t. Event (a) has probability \- H(t). To calculate the proba- 
bility of event (b), note that if X= x <t, then event (b) occurs if and only if 
t lies in the interval (/ — x, t). (See Figure 3.2.) Since by assumption the 
arrivals occur according to a Poisson process, it follows [see the argument 
surrounding Equation (5.29) of Chapter 2] that the location of the arrival 
epoch t is uniformly distributed in (0, /). Therefore the probability that i 
lies in the interval (t — x,t) is x/t. Hence, event (b) has probability 
f' (x/t)dH(x), and p(t) is given by 

p{t) = \-H{t)+ C?-dH{ x ). (3.9) 
t 
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Figure 3-2. 

Equation (3.9) gives the probability p(t) that an arbitrary customer, who 
is assumed to have arrived in (0,/), is still in service at time t. Thus, if n 
customers arrive in (0,/) according to a Poisson process, the probability 
that exactly j of them will still be present at time t is the binomial 

(j)p J (t)[l- P (t)y- J ; 

it now follows from the theorem of total probability that 

fy')- %{jy(»[l-p(»] H - jS £e-*. (3-10) 

Reference to Equation (5.52) of Chapter 2 shows that 

\\tp(t)] J ^ ^ 
p j(')= e ^ 0-0,1,...). (3.11) 

Hence, for arbitrary service- time distribution function, the number of 
customers in service at time />0 has the Poisson distribution with time- 
dependent mean Xtp(t), [The reader should compare (3.11) with Equation 
(4.26) of Chapter 2.] For another derivation of (3.11), see Exercise 45. 

We now consider the statistical-equilibrium distribution (3.8). We leave 
it to Exercise 1 1 for the reader to show that for any distribution function 
H(x) (0 < x < oo ) with finite mean ju, ~ 1 = JqX dH(x), 

lim t\ 1-7/(01=0. (3.12) 

It follows from (3.9) and (3. 12) that lim,^ tp(t) = n~\ Hence, taking limits 
through (3.11) yields (3.8), as asserted. 

Exercises 

11. Prove Equation (3.12). 

12. Blocked customers held. The following model is widely used in teletraffic 
engineering: Customers arrive according to a Poisson process at a group of s 
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servers. An arriving customer is willing to spend an amount of time T (called 
the sojourn time) in the system, where T is a random variable, after which he 
will depart regardless of whether or not at the expiration of his sojourn time 
he is in service or is still waiting in the queue. In other words, a blocked 
customer will wait for service as long as time T; if he receives service before 
the expiration of T, he then holds the server for the remainder of the time T. 
There is no limit to the number of customers who may be waiting for service. 

a. Show that the distribution {^-(0} of the number of customers present at 
time / is given by (3.11), where H(x) is the distribution function of the 
sojourn times. 

b. Show that if the sojourn times are exponentially distributed with mean 
then so are the service times. 

c Let q be the equilibrium probability that an arriving customer does not 
receive service. Show that if the sojourn times are exponential, then 



where a=\/ \i is the offered load and P(s,a) = '2j > wms Pj- Show that 
q = 1 - a'/ a, where d is the carried load, defined by (1.9). (Can the exponentiality 
assumption be relaxed?) 

An interesting variation of the Erlang loss model is obtained with the 
additional assumption of ordered hunt. That is, we assume that the servers 
are numbered 1,2,..., s, and each arriving customer takes the lowest-num- 
bered idle server. As before, we assume that customers arrive according to 
a Poisson process, blocked customers are cleared, and the system is in 
statistical equilibrium. Again, it is not necessary to assume that service 
times are exponentially distributed. For this model we calculate the load 
carried by each server. 

Real systems that might be described by such a model include a parking 
lot in which the parking places are deemed less desirable as they get farther 
from the exit, or a telecommunications system in which the trunks can be 
financed on either a flat-rate or measured-rate (usage) basis (see Exercise 



Let N k denote the number of servers among the first k (k~ 1,2,...,j) 
ordered servers that are busy at an arbitrary instant in equilibrium. 
Because blocked customers are cleared, the values realized by the random 
variable N k do not depend on the disposition of any customer who 
overflows the first k ordered servers. Thus, for each value of the index k 
the distribution of N k is the Erlang loss distribution (3.3); that is, 



q-P(s,a)-±P(s+l,a) t 



13). 



P{N k =j} = 




(y-0,1,...,*; £=1,2,...,*), (3.13) 
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where a is the offered load. Also, it follows from (3.5) that the mean 
number E(N k ) of servers busy among the first k ordered servers is 

E(N k ) = a[\-B{k,a)} .(*-l,2,...,j). (3.14) 

To find the load carried by the yth ordered server in the A>server group, 
let us first define the random variables Xj (y = l,2,...,.s), with Xj = when 
the yth ordered server is idle and Xj = 1 when the yth ordered server is busy. 
Then E{XJ) is the mean number of busy servers in the single-server group 
that consists of only the yth ordered server; therefore, by definition E{X) is 
equal to the load carried by the yth ordered server. If we define pj ■, — P{Xj = 
1}, then E{Xj)=Pj', that is, the load carried by a single server equals the 
probability that the server is busy at an arbitrary instant (in equilibrium). 

Now, clearly, 

Nj-X x +-- +X p (3.15) 

from which it follows that 

x j = n j- n j-v (3-16) 

Taking expected values in (3.16) yields 

E(X J )^E{N J )-E(Nj_ x \ (3.17) 

which, together with Equation (3.14), implies that the load carried by the 
yth ordered server is given by 

Pj = a[ B(j -ha)- B(j, a) ] . (3. 18) 

Equation (3.18) can be interpreted to say that the load carried by the y th 
ordered server is the difference between the load that overflows the 
(y — l)th ordered server and that which overflows the yth ordered server. It 
is intuitively obvious that Pi>p 2 >P3> for a proof see Messerli [1972]. 

Exercises 

13. Suppose that a company with a private telephone network can choose from 
two classes of telephone trunks, of which the first is paid for by a flat monthly 
rental that is equivalent to $14 per hour, and the second is charged for 
according to usage at the rate of $.50 per minute. (These numbers are for 
illustrative purposes only, and were not chosen to reflect any existing rate 
structure.) If it is required that the total number of trunks should be sufficient 
to handle the offered load, estimated at 2 erlangs, with a loss no higher than 
2%, what is the most economical division between measured-rate and flat-rate 
trunks, and what is the associated cost? 
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14. Prove that in an Erlang loss system with ordered hunt the equiUbrium 
probability that an arbitrary customer will be served by they'th ordered server 
equals the ratio of the load carried by that server to the total offered load. 

15. Prove that the variance v of the Erlang loss distribution (3.3) is given by 



where a' is the total load carried by an j-server Erlang loss system and p s is 
the load carried by the last ordered server in the group. 

3.4. Poisson Input, s Servers with Exponential Service Times, 
Blocked Customers Delayed: The Erlang Delay System 

Consider now the case in which customers who find all s servers busy join 
a queue and wait as long as necessary for service, that is, blocked customers 
delayed (BCD). No server can be idle if a customer is waiting. The number 
of waiting positions in the queue is assumed to be infinite. Applications of 
this model should be self-evident. (We note for future reference that in the 
terminology of queueing theory, the ^-server BCD queue with Poisson 
input and exponential service times is called the M/M/s queue. This 
terminology will be discussed in Chapter 5.) Since customers arrive accord- 
ing to a Poisson process with rate X, and every arrival effects a change of 
system state, we have 



The assumption of exponential service times implies that if at any time all 
the j customers in the system are in service, the rate at which service 
completions occur is jp; if all s servers are busy, only those customers that 
are in service are eligible to leave, so that the service completion rate is sfi. 
Hence 



v = a'0-Ps), 



A-A = 0,1,...). 



(4.1) 




(4.2) 



Using (4.1) and (4.2) in Equation (1.1), we can write 



a J 



= 1,2,. ..,5-1) 



(4.3) 



and 




= j,j+1,...), 



(4.4) 
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where a~\/n and where P is given by 

(s-l k oo a k \-l 
2^+ 2^ (4.5) 
jfc-0 k] k-s sis* J 

If a < s, the infinite geometric sum on the right-hand side converges, and 

\k-o kl s\{\-a/s)J 

If a>5, the infinite geometric sum diverges to infinity. Then 7*0 = and 
hence P y — for all finite j. For a >s, therefore, the queue length is infinite 
(greater than any prespecified finite value) with probability 1. In this case, 
we say that no statistical-equilibrium distribution exists. 

Let C(s,a) denote the probability that all servers are occupied; C(s f a) = 
2jL s Pj. It follows from (4.4) that 

hence 



sUX-a/s) , , x 

C(s,a) — ± ^ (0<a<s). (4.8) 

£*k\ s\{\^a/s) 

Formula (4.8) is called the Erlang delay formula (because blocked 
customers are delayed until service commences) or Erlang C formula in the 
United States; in Europe it is called Erlang J s second formula and is denoted 
by E 2tS (a). Like the Erlang loss formula (3.4), the Erlang delay formula 
(4.8) was first published by Erlang in 1917 (see Brockmeyer et al. [1948]). 
Curves for (4.8) are given in Figures A.3 and A.4 of the appendix. See also 
Dietrich et al. [1966] and Descloux [1962]. 

Since we have Poisson input, the arriving customer's distribution {TIj) is 
identical to the calculated outside observer's distribution {Pj}. Thus, (4.8) 
gives both the proportion of time that all servers are busy and the 
proportion of customers who find all servers busy. 

Unlike the Erlang loss probabilities, the Erlang delay probabilities are 
not valid for an arbitrary service-time distribution. 

Again unlike the Erlang loss probabilities, the Erlang delay probabilities 
constitute a proper distribution only when the offered load is less than the 
number of servers. This restriction follows mathematically from the con- 
vergence criterion for the infinite geometric sum and does not apply if the 
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number of queue positions is finite (in which case customers finding all 

queue positions busy are cleared from the system). 

- From the definition (1.9) of carried load we can easily calculate that 

a' = a; (4.9) 

that is, the carried and offered loads are equal. This is intuitively clear, 
because in the Erlang delay model all blocked customers wait as long as 
necessary for service, and therefore all arriving customers are served. 
Indeed, (4.9) provides an intuitive partial explanation for the restriction 
a<s (or, equivalently, p< 1), for a>s would require the carried load (the 
mean number of busy servers) to exceed the number of servers, which is 
clearly impossible. 

Exercises 

16. a. Show that for every integer s >a, 

sB(s,a) 



C(s,a) = 



s-a[l-B(s,a)] ' 
and use this result to show that 

C(s,a)>B(s,a) when a>0. 
b. Show that for every integer s >a, 

C(s,a) ~ r 

l + (s-a)[aB(s-l,a)]~ l 

where B(0,a)=\. 
c Show that for every integer s >a + 1, 

C(s,a)= 1 



ts — a\ s—l — aC(s—l,a) 
\~a~) (s-l-a)C(s-l t a) 

How might these formulas be computationally useful? 

17. Review and reconsider Exercises 4 and 5 of Chapter 1. In the loss-delay 
system, what proportion of customers are denied service, what proportion 
wait for service, and what proportion are served immediately? 

Let us briefly consider the single-server case without the requirements of 
Poisson input and exponential service times. We have argued that the 
carried load equals the offered load by virtue of the assumption that all 
arrivals get served (although some may have to wait), and this is true for 
arbitrary interarrival-time and service-time distribution functions. Also, by 
the definition (1.9), in the single-server case the carried load equals the 
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proportion of time the server is busy, a' = 2jl,P y (where, in general, we 
may not know how to calculate the {Pj}). We conclude that the proportion 
of time that a single-server BCD system is busy is always equal to the 
offered load; that is, 1- P = a=\r. [As a check, note that C(\,a) = a.] 

A quantity of some interest for the single-server queue is the busy period, 
defined as the length of time from the instant the (previously idle) server is 
seized until it next becomes idle and there is no one waiting in the queue. 
The calculation of this quantity is difficult in general, and we shall 
investigate it in some detail in Chapter 5. But with the simple tools now at 
hand, we can easily calculate the mean busy period for systems with 
Poisson input (and arbitrary distribution of service times). Denote by b the 
mean length of the busy period. Then the ratio of the mean busy period to 
the total cycle time (contiguous idle period and busy period) is b/(X~ l + 
b), where A -1 is the mean interarrival time. But this ratio, which is 
independent of the form of the service-time distribution function, is simply 
the proportion of time that the server is busy; therefore, 



Since we are assuming that all arrivals ultimately get served, it follows that 

l~P =a=Xr. (4.11) 

We conclude from (4.10) and (4.11) that the mean busy period b is given 
by 

b= rh (o <«<*). ( 4 - 12 ) 

1 — a 



Observe that b increases rapidly as a approaches unity. Thus, for example, 
when the server occupancy is 90%, the mean busy period is 10 service 
times; when the server occupancy is 99%. the mean busy period is 100 
service times. 



Exercise 

18. Is the analysis leading to (4.12) valid for arrival processes other than the 
Poisson? Why? 

We now turn to consideration of the Erlang delay system with ordered 
hunt: The servers are numbered 1,2,..., s, and each arriving customer 
takes the lowest-numbered idle server. Blocked customers wait until 
served, and no customers can be waiting in the queue if there is an idle 
server. As usual, we assume Poisson input, exponential service times, and 
statistical equilibrium. 
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Recall that we previously considered the Erlang loss system with orderec 
hunt, and we obtained Equation (3.18) for the probability pj that the y'th 
ordered server is busy at an arbitrary instant in equilibrium. (Equivalently, 
pj is the load carried by the yth ordered server.) The derivation of (3.18) 
depended critically on the assumption of blocked customers cleared. Even 
though we are no longer making this assumption, it turns out, surprisingly, 
that we can nevertheless solve the corresponding problem using only 
elementary considerations. (However, in contrast with the Erlang loss 
model, we must now require that service times be exponentially distrib- 
uted.) 

To calculate the probability Pj that the yth ordered server is busy (under 
the assumptions of the Erlang delay model) we partition time into two 
mutually exclusive and exhaustive sets of intervals, namely, those time 
intervals during which there is at least one customer waiting and those 
time intervals during which there are no customers waiting. 

We let JV be the total number of customers in the system at an arbitrary 
time (at which the system is assumed to be in statistical equilibrium) and 
as before, define Xj = when the jth ordered server is idle and X J ■ — 1 when 
it is busy. Then, from the theorem of total probability we can write 

Pj = P{Xj=l\N<s}P{N<s}+P{Xj=l\N>s}P{N>s} 

0=1,2,...,*). (4.13) 

If no server can be idle when customers are waiting, then clearly P{Xj = 
l\N >s} = 1. We will now argue that, remarkably, 

P{Xj=\\N<s}=Pj (4.14) 

where p p given by (3.18), is the load carried by the yth ordered server in an 
Erlang loss system. 

To see the truth of (4.14), observe that because of the Markov property 
the behavior of the Erlang delay system during the time intervals when 
N<s is unaffected by the behavior of the system when N >s; therefore, 
the load carried by the yth ordered server during the time intervals when 
no customers are waiting is the same as that for the corresponding Erlang 
loss system. This clever intuitive argument was first advanced by Vaulot 
[1925], We shall give a generalization and a more rigorous proof in Chap- 
ter 4. 

Now, it follows easily from (4.4) that =(a/s) C(s, a); hence 

P{N>s} = ^C(s,a). (4.15) 

Therefore, we conclude from Equations (4.13)— (4.15) and (3.18) that the 
load pj carried by the yth ordered server in an j'-server Erlang delay system 
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with ordered hunt is given by 

Pj = a[ B(j -\,a)-B{j,a)][\- P C(s, a) ] + P C(s, a), (4. 1 6) 
where p = a/s is the average load carried per server (the server occupancy). 

Exercises 

19. Consider again the premise of Exercise 13. Now, however, assume that (a) 
sufficient waiting positions are provided so that no calls will be lost and (b) 
the total number of trunks in the group is 4. What is the most economical 
division between measured-rate and flat-rate trunks, and what is the 
associated hourly cost? 

20. Prove that for an j-server Erlang delay system, the statistical-equihbrium 
probability that j customers are in service, given that j <s, is the same as the 
corresponding unconditional probability for the Erlang loss system. Gener- 
alize this theorem so that it applies to general birth-and-death queueing 
models. 

21. Reconsider Exercise 14 with "Erlang loss system" replaced by "Erlang delay 
system." 

We now turn to the analysis of waiting times in the equilibrium Erlang 
delay system in which the queue discipline is service in order of arrival. In 
particular, we will calculate, for each t>0, the probability P{W>t) that 
an arbitrary customer will wait in excess of t before entering service. To 
this end, let us write 

P{W>t} = P{W>0}P{W>t\W>0}. (4.17) 

We shall call a queue discipline nonbiased* if it is such that whenever a 
customer is selected from the queue for service, his service time is a 
random variable with the same distribution function as an arbitrary service 
time. For example, nonbiased queue disciplines include service in order of 
arrival (first come, first served), service in random order, and service in 
reverse order of arrival (last come, first served). An example of a queue 
discipline that is not nonbiased is one sometimes called shortest-process- 
ing-time-first, in which the customer selected for service when a server 
becomes available is that customer whose service time is least among all 
those waiting (where, of course, it is assumed that these values can be 
ascertained). 



*In the first edition, we used the word nonscheduhd instead of nonbiased; the latter seems to 
be a more descriptive term. 
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If two queueing models are the same except for their queue disciplines, 
and if their queue disciplines are nonbiased, then they will have the same 
distribution of state probabilities. Hence, all other things being equal, the 
probability P{W>0} is the same regardless of the (nonbiased) queue 
discipline. In particular, for the Erlang delay system in equilibrium we 
have, for any nonbiased queue discipline, from (4.7), 

It follows that to find P{W>t) for the Erlang delay system with order- 
of-arrival service, it is sufficient to calculate the conditional probability 
P{W>t\W>0}. 

Let Q be the number of customers waiting for service at the instant of 
arrival of an arbitrary customer (the test customer). Then, it follows from 
the theorem of total probability that (for any queueing system) 

oo 

P{W>t\W>0}= 2 P{W>t\W>Q y Q=j}P{Q=j\W>0}. (4.19) 

j-0 

Recall from Equation (5.55) of Chapter 2 that for exponential service 
times, 

P{W>t\W>0, Q=j}= £ i^-e~^. (4.20) 
Now, from the definition of conditional probability we can write 

Note that although Q and W are defined from the viewpoint of the 
arriving customer, we can (because the input process is Poisson) adopt the 
viewpoint of the outside observer. Thus, 

P{Q-j t ^>0}-^ + ,-ff (f (4.22) 

where the second equality follows from (4.4). Substitution of (4.22) and 
(4.18) into (4.21) yields the geometric distribution 

P{Q=j\W>0}=(\-p)f>> (y = 0,l,...), (4.23) 

where p = a/s is the server occupancy. Now substitute (4.20) and (4.23) 
into (4.19) and compare with Equations (5.57) and (5.58) of Chapter 2. We 
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conclude that 

P{W>t\W>0} = e-V- p)stU ; (4.24) 

that is, for the equilibrium Erlang delay system with service in order of 
arrival, the conditional (given that the customer is not served immediately) 
waiting-time distribution function is the negative-exponential. 

To summarize, we have argued that (1) the waiting time for an arbitrary 
customer who finds all servers busy is the sum X x + • • • + X J+ x of the times 
that he and each of the j customers ahead of him spend at the head of the 
queue, that these times are independent exponential random variables with 
mean (jju) -1 , and hence their sum has the (_/' + l)-phase Erlangian distribu- 
tion (4.20). This is the analysis leading to Equation (5.55) of Chapter 2. 
The next step in our argument is that (2) the number of customers that the 
(assumed blocked) test customer finds ahead of him in the queue has the 
geometric distribution (4.23). Thus, the test customer's waiting time is a 
sum of independent, identical, exponential random variables, where the 
number of variables in the sum is geometric; therefore, as shown by 
Equation (5.58) of Chapter 2, the test customer's waiting time is exponen- 
tially distributed. 

Equation (4.24) is particularly suited to graphical presentation; for each 
value of p, Equation (4.24) gives a straight line when P{ W >t\ W>0} is 
plotted against sfit on semilog paper, as in Figure A.5 of Appendix A. The 
unconditional waiting- time probability P{W>t} is easily obtained from 
Equations (4.17), (4.18), and (4.24): 

P{W>t} = C(s,a)e-V- p)sflt . (4.25) 

Numerically, we can easily calculate P{W>t) by evaluating the factor 
e -(i-p)s/*» f rom graph in Appendix A, evaluating C(s,a) from its graph 
in Appendix A or from one of the tabulations of the Erlang delay formula, 
and forming the required product. 

Since the conditional waiting time for blocked customers is exponen- 
tially distributed according to (4.24), it follows that the conditional mean 
waiting time E{W\ W>0) (the mean wait suffered by those customers who 
are blocked) is given by 

E(W\W>0)= 1 , (4.26) 
(l-p)s/t 

and thus the overall mean waiting time E( W) is given by 

These formulas are tabulated or plotted in Dietrich et al. [1966] and 
Descloux [1962]. 
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It is an important fact, which we shall prove later, that the formulas 
(4.26) and (4.27) are valid for an Erlang delay system with any nonbiased 
queue discipline, even though the derivation given here assumes service in 
order of arrival. 

Finally, a note on terminology: Some queueing theorists call the waiting 
time that would be experienced by an outside observer, were he in fact to 
join the system as a customer, the virtual waiting time; and they call the 
waiting time experienced by an arriving customer the actual waiting time. 
We have shown that the arriving customer's distribution {Hj} and the 
outside observer's distribution {Pj} are equal for systems with Poisson 
arrivals, from which it follows that the actual waiting times and the virtual 
waiting times have the same distribution for systems with Poisson arrivals. 
In particular, it follows that Equation (4.25) describes both actual and 
virtual waiting times in the equilibrium M/ M/s queue. 

Exercises 

22. Repeat Exercise 5 with "Erlang loss system" replaced by "Erlang delay 
system with order-of-arrival service," and with the additional requirement that 
the entrepreneur must pay $10 for each customer whose wait for service 
exceeds ^ hour. If the entrepreneur has control over the order of selection 
from the queue, should he select in order of arrival? 

23. Consider a 10-server Erlang delay system that handles an offered load of 6 
erlangs. If the arrival rate increases by what is the corresponding increase 
in 

a. the fraction of customers who must wait for service, and 

b. the mean waiting time for those customers who must wait for service? 
Repeat the exercise for the case where, instead, the mean service time 
increases by a factor of j. 

24. In an Erlang delay system with service in order of arrival, what proportion of 
those customers who are blocked wait longer than the average waiting time 
for blocked customers? 

25. Consider a telephone system in which the central office equipment that 
provides dial tone (the dial-tone machines) can be modeled as an Erlang delay 
system with service in order of arrival. (That is, the dial-tone machines are the 
servers, and a subscriber who goes "off hook" to place a call is a customer.) If 
a subscriber goes off hook and still has not received dial tone after 30 
seconds, what should she do? 

26. Show that in the Erlang delay system with order-of-arrival service, the 
customer waiting-time distribution function has variance 
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27. Let W be the waiting time and T the sojourn time (waiting time plus service 
time) of an arbitrary customer. Show that in the single-server Erlang delay 
system with service in order of arrival, 

P{ T >t} = P { W >t\ W >0}. 

28. a. Consider an Erlang delay system, and denote by L the mean queue length, 

W the mean waiting time for service, and A the arrival rate. Verify the 
equation L = \W. (This well-known equation, often called Little's theorem, 
is true under very general conditions and is often written using the present 
symbols. See Section 5.2.) 
b. Redefine L to be the mean number of customers in the system (waiting and 
in service) and W to be the mean time spent in the system. (W is the mean 
sojourn time.) Verify L=XW using these redefinitions. 

29. Prove that in an Erlang delay system with order-of-arrival service, the 
equilibrium conditional probability that a blocked customer will still be 
waiting in the queue when the next customer arrives is equal to the server 
occupancy or utilization factor p. (This is a special case of a more general 
result. See Exercise 33 of Chapter 5.) 

30. Let N be the number of customers found by an arrival at an j-server Erlang 
delay system. Using the fact that the events {Q=j, W>0) and {N—s+J} 
are equivalent, show that Equation (4.19) can be written (for any nonbiased 
queue discipline) 

P{W>t\W>Q}={\-p) £ p J P{W>t\N=s+j], 

j-0 

where p is the server occupancy. 
1. Consider the differential-difference equations 

[t>0; j = 0,l,...; F_,(/) = 0], (1) 
where c is an arbitrary constant. Define 

2 Fj(')x J - (2) 

j-0 

a. Show that F(x,t) satisfies 

■^F(x,t)-c(x-l)F(x,t), (3) 

which has solution 

F(x,t) = F(x,0)e- (l ~ x)c '. (4) 
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b. Let N(t) be the number of events occurring in (0,/) according to a Poisson 
process with rate A. Then, according to Equation (2.5) of Chapter 2, we can 
set Fj(t)=P{N(t)~j} = Pj(t) and c=X in Equation (1). Show that Equa- 
tion (4) yields the Poisson distribution, 

/>.(,)= Qjfe-* = 0,1,...). 

c. In an Erlang delay system with s servers, mean service time fi~\ and 
service in order of arrival, let N be the number of customers waiting or in 
service when an arbitrary customer (the test customer) arrives, let W be the 
test customer's waiting time, and define Wj(t)**P{ W>t\N=s+j). Show 
that 

Wj{h + t) = Sf ihWj- .(O + il-snh) Wj(t) + o(h) (h^O), 
and therefore, 

A-WjW-spWj.M-siiWjit) 
[t>0; 7 = 0,1,...; ^_,(0=0]. (5) 

d. Define 

W(x,t)= § Wj(t)x J t (6) 

j = 

and show by comparison with Equation (4) that 

W(x,t)= W(x,G)e^ l - x W. (7) 

e. Show that 

W(x,0)= J ^ c . (8) 

f. Show that the result of Exercise 30 implies 

P { W >t\W >0} = (1 -p) W(p,t). (9) 

g. Deduce from Equations (7)-(9) that 

P{W>t\W>0} = e- il - p) ^. (10) 

Compare with Equation (4.24). 

h. Show directly from Equations (6)-(8) that 

£ ^-e-*». (11) 
Compare Equation (11) with Equation (4.20). 
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32. Service in random order. Consider the equilibrium ^-server Erlang delay sys- 
tem with service in random order. Let N be the number of customers wait- 
ing or in service when an arbitrary customer (the test customer) arrives, let W 
be the test customer's waiting time, and define W/J) = P{ W>t\N-s+j). 
a. Show that 

Wjih + t^teWj^ + j^svhWj^t) 

+ [l-(X + sn)h]Wj(t) + o(h) 
[h^O; y = 0,l...; ^_,(0=0], 



(1) 



and therefore, 
d 



[y-<U...; w_i(0-0], 



(2) 



where 



^(0)=i o=o,i,...). 

b. Define 

w j' ) -(-7F w w) i _ O-* 1 --; *-<u....; ^j 0) =l), (3) 

and assume that Wj(f) has the Maclaurin series representation 

Wj (t)= f 0=o,i,...). 

Using the result of Exercise 30, show that 

P{W>t\W>0} = \ + (\-p) pM - 

c Show that the right-hand side of (5) can be evaluated to give 

P{W>t\W>0) ^\-SLU i —^ln—^- 

p l-p 



(4) 



(5) 



+ ^(1-P) 



2-l^ln 1 



l-p 



- + 



(6) 



For further discussion of this method see Riordan [1953]. Pollaczek [1946] 
gives a closed-form solution for the probability P{ W > t\ W > 0}. 
Summaries of Pollaczek's and other studies are given in Riordan [1962] and 
Syski [I960]. 
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33. Let B{t) be the distribution function of the busy period in the single-server 
Erlang delay system, with mean b = ffi dB(t); and let W be the equilibrium 
waiting time of an arbitrary customer. Convince yourself that when service is 
in reverse order of arrival, then P{ W<t\W>0) = B(t). Conclude, therefore, 
that E(W\W>0) = b, where b is given by (4.12). Note that this agrees with 
(4.26) when s*=l, thereby providing an illustration of the theorem that the 
mean waiting time does not depend on the (nonbiased) order of service. 



3.5. Quasirandom Input 

In all the birth-and-death queueing models considered so far, we have 
assumed that requests for service occur according to a Poisson process 
with rate X. With Poisson input, the probability that a request for service 
will occur in any interval (t, t + h) is independent of the state of the system 
at time This important property is at the heart of the proof in Section 3.2 
of the equality of the outside observer's distribution {Pj} and the arriving 
customer's distribution {IZ,} for systems with Poisson input. 

Consider now a system in which the requests for service are generated 
by a finite number of sources. In such a system the probability of an 
arrival in an interval (t, t + h) will not be independent of the system state 
at time t, since it will, in general, depend directly on the number of sources 
idle (and therefore available to generate new requests) at time t. Conse- 
quently, the outside observer's and the arriving customer's distributions are 
not equal. 

These properties are illustrated by the finite-source system with an equal 
number of sources and servers. When all the servers are occupied no new 
requests can occur, since there are no idle sources to generate new 
requests. Therefore the probability of blocking is zero, whereas the propor- 
tion of time all servers are busy can take any value between zero and one. 

The particular kind of finite-source input we shall consider is often 
called quasirandom input (as opposed to Poisson or completely random 
input). We say that a finite number n of (identical) sources generate 
quasirandom input if the probability that any particular source generates a 
request for service in any interval (t, t + h)isyh + o(h) as h^O if the source 
is idle at time and zero if the source is not idle (waiting or being served) 
at time t, independently of the states of any other sources. 

It follows from this definition that if a particular source is idle at time t, 
the distribution of time from t until the source next generates a request for 
service is exponential with mean y" 1 ; that is, with probability e yx the 
source will not originate a request for service in (t, t + x). 

If the number of sources idle at time t is n, the probability that exactly j 
of them generate requests for service in (t, t + x) is 
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Now let >c© and y-»0 in such a way that ny = \. Then, it can be shown 
that 



Thus, assuming that each idle source can generate at most one request in 
(/, t + x% we have shown that the distribution of the number of requests 
generated is, in the limit, Poisson. It should be intuitively clear that 
because y^O the restriction of at most one request per source can be 
removed without affecting the conclusion. Then, more formally, we con- 
clude: 

If the number n of independent sources generating quasirandom input 
increases to infinity and the request rate y per idle source decreases to zero 
in such a way that the overall idle-source request rate ny remains constant 
(ny=A), then in the limit the input process is Poisson with rate A. 

In the case of Poisson input the number of sources, or potential 
customers, is infinite (and is therefore often called infinite- source input). 
However, the sources affect the system only when they make requests for 
service, that is, when they become customers. Therefore, with Poisson 
input, there is no need to distinguish between sources and customers. This 
is not true, however, in the case of finite-source input. With finite-source 
input, a source assumes the role of a customer when it places a request for 
service, and it remains a customer as long as it is either waiting for service 
or in service. As soon as it becomes eligible to generate a new request for 
service, it again assumes the role of a source. We shall refer to a source as 
a customer only when we wish to stress that it is making a request for 
service. Any source can become a customer more than once, so the 
number of blocked customers is the number of requests (generated by 
sources) that are blocked. To avoid this confusion, we shall say that a 
source generates a request rather than that a source becomes a customer. 
The meanings of the words customer, source, and request should be clear 
from the context in which they are used. 

Let us consider a system with n sources, s servers, exponential service 
times, and quasirandom input, where each idle source generates requests 
for service at rate y. If at any time t the number of idle sources is n — y, 
then the probability that a request for service will occur in (/, t + h) is 
(n-j)yh + o(h) as A-»0. Thus queueing models with quasirandom input 
and exponential service times can be studied in the framework of the 
birth-and-death process, in a manner analogous to that used for the 
corresponding systems with Poisson input; we simply have a different 
specification of the birth coefficients {AJ. 

As with the infinite-source models considered previously, it is often 
useful to think in terms of carried and offered loads. In Section 3.1 we 




Y-*0 
ny ~A 



(5.1) 
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defined the carried load a', for any input process, by Equation (1.9). We 
can also retain the definition of offered load a, given by Equation (1.11) as 
the product of the arrival rate (the mean number of requests per unit time) 
A and the mean service time t. With Poisson input, the arrival rate A is 
independent of the system to which it is offered and thus provides a simple 
characterization of the amount of traffic that is to be handled by any 
system that may be provided to handle it. The same is not true of 
quasirandom input, because the instantaneous arrival rate A^ depends on 
the state of the system, and the proportion of time the system spends in 
each state depends on the number of servers and the disposition of blocked 
customers. Therefore, although the traffic can be characterized by the 
number of sources n and the idle request y per source, independently of 
the system to which it is offered, the mean number of requests per unit 
time A (and hence also the offered load a) that a system receives depends 
directly on the characteristics of the particular system that receives it. 

For each of the quasirandom input models that we consider, we shall 
derive a relationship between the offered load per idle source d=y/p and 
the total offered load a. Unfortunately these relationships involve state 
probabilities that depend on the number of servers. For this reason, it is 
sometimes useful to consider the intended offered load a*, defined as the 
load that the sources would offer to the system if there were enough servers 
so that no blocking could occur (that is, s = n). Then, as we shall show 
later, the intended offered load a* is given, for any H-source system, by 



where d = y/[i is the offered load per idle source. 

Finally, let us note for future reference that for systems in which the 
queue discipline is blocked customers cleared or blocked customers de- 
layed, the carried load a' can be calculated from the formula 



To prove (5.3), we sum both sides of the basic "rate up = rate down' 
equations (1.2), where we take 



a 




(5.3) 




(5.4) 



[The death rates defined by (5.4) describe both the delay model and the 
loss model. In the latter case, the values for j>s are irrelevant because 
A^O, which implies that Pj = for j>s.] Using the fact that, for n-source 
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systems with quasirandom input, A„ =0 and Pj = for j >n, we obtain 

"2 h p j = M 2 V, + J" 2 ^ (5-5) 
Comparison of (5.5) with (1.9) proves (5.3). 

3.6. Equality of the Arriving Customer's a? -Source Distribution 
and the Outside Observer's (n — l)-Source Distribution for 
Birth-and-Death Systems with Quasirandom Input 

We have already shown by example that in systems with finite-source 
input, in contrast to the infinite-source case, the outside observer's distri- 
bution {Pj} and the arriving customer's distribution { IIj} are, in general, 
unequal. In the case of the birth-and-death queueing models with quasi- 
random input, we can calculate {Pj} directly from the birth-and-death 
probabilities (1.1). But { IIj} is the distribution of direct relevance to one 
concerned with the quality of service provided by the servers to the 
customers. Hopefully, the two distributions will be simply related. Happily, 
they are. 

For example, consider the simple system composed of one source and 
one server. Suppose that the source generates requests at rate y when idle 
and rate zero when not idle, and let the mean service time be t. Let yr = a; 
a is the load offered by the source when idle. The server alternates between 
busy and idle states, with each busy period of mean length t. Consider a 
single cycle composed of an idle period and an adjacent busy period. The 
cycle has mean length y ~ 1 + t, and thus the ratio of the mean busy period 
to the mean cycle length is 

t _ a 

y _1 + T 1 + 

As we did with a similar example for Poisson input, we interpret this ratio 
as the proportion of time (or probability) that the server is busy when the 
system is in equilibrium. This ratio is not the proportion of requests 
blocked, which we have already argued is zero when (as in this example) 
the number of sources does not exceed the number of servers. Let Pj[n] 
and IIj[n] be the w-source outside observer's and arriving customer's state 
probabilities, respectively. We then have P l [l] = d/(\ +d) and Pdl] = 
1/(1 + a) for the outside observer's distribution, in contrast with the 
arriving customer's distribution, iT 1 [l] = and 2I [1] = 1. 

Consider now the same system with the addition of an identical source. 
Since blocking can occur in this two-source, one-server system, we must 
consider the disposition of blocked requests. Let us assume that blocked 
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customers are cleared. (That is, any source that finds all servers busy when 
placing a request returns immediately to its previous state as an idle 
source.) The server again alternates between busy and idle states. The busy 
period still has mean length t. But since there are now two sources, each 
bidding for service at rate y when idle, the overall bid rate when both 
sources are idle is 2y, and the idle portion of the cycle has mean length 
(2y) _I . Forming the ratio of the mean busy period to the mean cycle 
length, we obtain 

r _ 2d 
(2y)"' + t ~ l+2a ' 
Thus the outside observer's distribution is P 1 [2] = 2«/(l +2a) and P<p.] = 
1/(1 + 2a). 

Blocking can occur only when the server is occupied by one of the 
sources. The number of requests blocked during a cycle is the number 
generated by the idle source during the service time of the busy source. 
Since we have assumed blocked customers cleared, the mean number of 
requests generated by a source during the service time of the other source 
is yT = a . On the other hand, the total number of requests generated during 
a cycle is simply the sum of the number generated during the busy period 
plus the number generated during the idle period. The latter number is 1, 
namely, the arrival who finds the server idle, ending the idle period and 
starting the busy period. Forming the ratio of the mean number of requests 
blocked to the mean number generated per cycle, we obtain d/(\ + d). 
That is, U,[2] = a/(l + a) and JT [2] = 1/(1 + a). 

Comparison with our calculations for the one-source case shows that 
JT [2] = PJl] and J7 1 [2] = i > ,[l]. In other words, the state distribution seen 
by a source in the two-source system when placing a request is the same as 
the state distribution the source would see if it placed no requests but 
instead acted the part of an outside observer of the one-source system. 

It is true in general that for equilibrium «-source birth-and-death sys- 
tems with quasirandom input, the state distribution seen by a source when 
placing a request is the same as if that source were not contributing load to 
the servers, but instead were only observing the corresponding (n — 1)- 
source system continuously or at random instants. For many years the 
(« — l)-source outside observer's viewpoint was considered only an ap- 
proximation to the H-source arriving customer's viewpoint, since (it was 
argued) no account is taken of the fact that the particular source places a 
load on the system, interacting with other sources, and thus affecting the 
state distribution. This argument neglects to mention that the requests 
generated by any particular source do not follow a Poisson process (why?); 
apparently these two effects cancel each other exactly. 

We now give a formal proof of the general theorem: In any equilibrium 
w-source birth-and-death queueing system with quasirandom input, the 
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arriving customer's distribution is the same as the outside observer's 
distribution for the corresponding (n — l)-source system. 

We consider a system with n sources, each source originating requests at 
rate y when idle and rate otherwise (quasirandom input). Then the 
request rate when j sources are busy (in service or waiting for service) is 

\ = (n-j)y = 0,1,. ..,«). (6.1) 

To calculate the arriving customer's distribution {JJj}, substitute (6.1) into 
(2.6); this yields 

(n-j)P, 

Uy J . (6.2) 



In order to emphasize the dependence on the number n of sources, we 
write Pj = Pj[n] and FIj = Tl^n]. Then (6.2) becomes 

n A»] = „-\ = 0, 1,...,„- 1). (6.3) 

2 (n-k)P k [n] 

k = 

Now, according to Equations (1.1) and (6.1), the outside observer's distri- 
bution can be written 

W- *"- 1 ^? 1 " '™ = ..2 .> (.4) 



and 



^■-(-l. - ( -'^^ t " T T ' <-> 



Substitution of (6.4) into (6.3) yields 

n(n- !)•••(« -j +\)( n -j)yJ p 
nj [n] = — 



k=*i Pith' " Pk 



0=1,2,. ..,«-!). (6.6) 
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After cancellation of the factor nPJ[n] in (6.6), we have 
(n-\)-(n-j)yj 

JLlj jLU ' ' ' fl- 

■ - 0"-U,...,»-l). (6.7) 

J x , (*- !)-••(/»-*)?* 

Comparison of Equation (6.7) with (6.4) and (6.5) shows that J2,[«]== 
Pj[n — 1] for y= 1,2,...,« — 1. Since we must have 

"f^["- l ] = ^["] = l. 
y-o y-o 

we conclude that 

nj[n] = Pj[n~\] (y = 0,l,...,«-l). (6.8) 

The theorem is proved. For further discussion, see Exercises 1 and 5 of 
Chapter 4 and Exercise 1 of Chapter 5. 

Taking limits through (6.8) as /i-»oo and y->0 while holding «y = \, the 
dependence on n vanishes and, since quasirandom input becomes Poisson 
input in the limit, we again illustrate the equality of the arriving customer's 
distribution and the outside observer's distribution for systems with Pois- 
son input. 



3.7. Quasirandom Input, s Servers, Blocked Customers Cleared: 
The Engset Formula 

We assume that the number of sources is n>s and that each source 
independently generates requests at rate y when idle and rate zero other- 
wise. Since blocked customers are assumed cleared, a new request will 
effect a change of state only when at least one server is idle. Hence, we 
have birth rates 

^ / («-/')? = 0, 1,..., j- 1), ( n 

"no o=,). (7 - 1} 

Let us assume for the present that service times are exponential. Then we 
have death rates 

ty=M 0'=1,2,...,j), (7.2) 

where jti" 1 is the mean service time. As with the Erlang loss model, the 
assumption of exponential service times can be shown to be superfluous. 
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The rates (7.1) and (7.2), in conjunction with (1.1), give for the statisti- 
cal-equilibrium probability of j busy servers (where d — y/ii) 

p A"] = ~ r~~ — 0= 0, 1 s) (7.3) 

fc = X K/ 

and Pj[n] = Q for j>s. If the substitution 

a-^- p (7.4) 

is made in (7.3), then, after multiplication of numerator and denominator 
by (1 -pf, Equation (7.3) takes the form of the truncated binomial distribu- 
tion 

(")p J (l-p) n - J 
P A n ] ~ 00,1,...,*). (7.5) 



fc = 

Solving (7.4) for p gives 

which is immediately recognized as PJ1], In other words,/? is the probabil- 
ity that an arbitrary source would be busy if there were no interaction 
among the sources. If n = s, in which case each source assumes busy and 
idle states unaffected by the states of the other sources, then the denomina- 
tor on the right-hand side of (7.5) sums to unity, and the distribution of the 
number of customers in service is the simple binomial. In this case the 
mean number of busy servers is np; thus, the intended offered load a*, 
defined as the load that would be offered (and carried) if s = «, is given by 
(5.2), as asserted. 

Expression (7.3) — or (7.5) — gives the proportion of time the system 
spends in each state E r The probability IIj[n] that one of the n sources 
finds the system in state Ej when placing a request is obtained by replacing 
n by n — 1 in (7.3): 

("T'W 

m«] . , , , = 0,1,. ..,*). (7.7) 

-l i -it 
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In particular, the proportion of requests that find all s servers busy is 
II s [n]; the right-hand side of (7.7) with j = s is often called the Engset 
formula. 



Exercise 
34. Show that 

lim 7I[n] = i > / , 

n— »oo J J 
Y-»0 
ny — X 

where IIj[n] is given by (7.7) and P } is given by (3.3). 

To calculate the carried load a' we use Equation (5.3). Substitution of 
(7.3) into (5.3) gives, after some calculation, 

«' = a*[l-(l-^[ W ]], (7.8) 

where a* is the intended offered load, defined by (5.2). In view of the fact 
that the carried load a' is that portion of the offered load a that is not 
cleared, we have 

a' = a{\-n s [n}). (7.9) 

Equations (7.8) and (7.9) give the following relationship between the 
offered load per idle source a = y/ju and the total offered load a: 

a(l-n s [n]) 

d= n(l-P s [n])-a(l-n s [n]) + sP s [n] ' ^ 

Thus we see that the offered load generated by a finite number of sources 
cannot be specified without first calculating its effect on the particular 
system to which it is offered. 

Recall from Section 3.5 that we defined the intended offered load a* as 
the load that the sources would offer to the system if there were as many 
servers as sources (in which case no blocking would occur, and the carried 
and offered loads would be equal). Clearly, when no blocking occurs the 
queue discipline is irrelevant; therefore, we can make the calculation for 
the model at hand (blocked customers cleared) and apply it to all systems 
that differ from the present one only in the queue discipline. Thus, we have 
established (5.2) in general. 

The point to be made here is simply that the offered load a is a most 
useful and simple measure of demand for Poisson traffic, but not nearly so 
useful or simple for quasirandom traffic. When blocked customers are 
cleared, the offered load a is bounded from below by the intended offered 
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load a* (why?); thus Equation (5.2) provides a low-side estimate of the 
offered load when the probability of blocking is small. 

The results of this section were calculated here on the assumption 
of exponential service times. But, as with Poisson input, when blocked 
customers are cleared the birth-and-death equations (n — j)yPj = 
(j+l)pP J+l (j = 0,l,...,s—l) are valid for any service-time distribution 
function with finite mean p~ l ; and hence (7.7) also remains valid. (Note 
that in the heuristic discussion of single-server systems with one and two 
sources, no assumption was made about the form of the service-time 
distribution function.) Cohen [1957] (see also pp. 278-284 of Syski [I960]) 
has derived a "generalized Engset formula," which shows that for finite- 
source BCC systems with nonidentical sources, the state probabilities 
depend only on the mean service times and mean idle-source interarrival 
times for each source, and not on the forms of their respective distribution 
functions. 

Exercise 

35. Four sources share access to two servers. Assume that (1) the input generated 
by the sources can be described as quasirandom, with each source char- 
acterized by a mean time between requests when idle of y ~ 1 = 27 minutes and 
a mean service time ju -1 =3 minutes, and (2) blocked customers are cleared. If 
an additional identical source is given access to the servers, by what per- 
centage will the probability of blocking increase? If a statistician were 
observing the system and tabulating the total number of requests for service 
per hour generated by the sources, what values would he obtain (when n = 4 
and n = 5)? 

3.8. Quasirandom Input, s Servers with Exponential Service Times, 
Blocked Customers Delayed 

We now turn to the finite-source analogue of the Erlang delay model: We 
assume that n sources generate quasirandom input to a group of s ex- 
ponential servers, and that blocked customers wait as long as necessary for 
service. This model has found wide application. For example, Scherr [1967] 
has used it to predict the performance of a computer system composed of a 
central processing unit (server) and a number of teletypewriter terminals 
(sources), using either batch processing or time sharing. Teletraffic en- 
gineers have used it to describe systems in which a small number of lines 
(sources) in a central office have access to a common outgoing trunk group 
(servers). Another important example is provided by the so-called machine 
interference or repairman models. Here the sources are machines that can 
break down, thereby requiring the attention of a repairman (server). 
References, numerical examples, and related models are given in Cox and 
Smith [1961], Feller [1968], and Page [1972]. 
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In this model we have quasirandom input, where every request for 
service effects a change of state; hence 

\-(»-y)y U=o, !,...,»), (8.1) 



where n is the number of sources and y is the request rate for an idle 
source. The rate at which busy sources become idle is 

p.-!-** 1 0'=1>2,...,j), 

J {Sfl (j = S+l, 5 + 2,. ..,«), 



(8.2) 



where ju. - 1 is the mean service time. 

With d = y/n the rates (8.1) and (8.2), in conjunction with Equation 
(1.1), give for the statistical-equilibrium probability of j customers present 



0=1,2,. ..,^-1), 



(n-jy.sls' 
where PJn] is given by 



: — d J P \n] = 5,5+1,. ..,«), 



(8.3) 



P ["] 



s-l 
k 



y( n )a*+ 2 a 



- 1 



(8.4) 



Note that, in contrast with the infinite-source Erlang delay model, the 
number of terms in the second summation on the right-hand side of (8.4) is 
finite. Therefore the equilibrium-state distribution (8.3) and (8.4) is nonde- 
generate for all values of the parameters y and |Lt. Thatis, since the number 
of sources n is finite, the number of customers waiting for service can 
never exceed the finite value n — s, so that ever-increasing queue lengths 
cannot occur. In effect, finite-source systems are self-regulating, since the 
request rate gets smaller as the system gets busier, and the stream of 
requests shuts off completely when there are no idle sources available to 
generate new requests. 

The probability P{W>0} that a customer must wait for service is the 
probability that a source finds all servers busy when placing a request. 
Therefore 

P{W>0}= 2* /?,[«]= "s < 8 - 5 ) 
To calculate the waiting-time distribution function, we can write 

n — s— I 

P{ W>t }= 2 P{W>t\N = s+j}P{N*=s+j], (8.6) 

7 = 

where N is the number of customers present at (just prior to) an arrival epoch. 
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P{N=s+j} is the probability that a source finds all s servers busy and j 
other sources waiting for service when it places a request for service. Hence 

P{ N=s +j)=n s+j [n] = P s+J [n-\}. (8.7) 

If requests are served in order of arrival, the conditional probability 
P { W > t\N= s +j] that a customer waits beyond t when placing a request, 
given that he finds s+j other customers present, is exactly the same in a 
finite-source system as in its infinite-source counterpart; the waiting time 
of a customer who finds all s servers busy and j other waiting customers 
ahead of him is the sum of the lengths of time that each of the j + 1 
customers spends at the head of the queue. Since these time intervals are 
independent exponential variables with common mean (j/x)~ l t we have [see 
Equation (4.20)] 

P{W>t\N = s+j}=e-**2 (8.8) 



By the same argument 



E(W\N=s+j)= J -±±. (8.9) 



Substituting (8.7) and (8.8) into (8.6) we obtain (see Exercise 36) 

P{W>t} = c 2 ^^r-e-*^, (8.10) 
j=o J- 

where 

<P(t)=-^+SfLt (8.11) 

and 

c= n 4n] (^(|p->, (8 , 2) 

Observe that the distribution of W is essentially Erlangian of order n — s. 

The order-of -arrival waiting-time distribution function (8.10) and some 
related quantities have been tabulated by Descloux [1962], who also gives a 
concise discussion covering much of the same material presented here. 
Tabulations are also available in Peck and Hazelwood [1958]. 
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Exercise 

36. Verify Equation (8.10). 

Using (8.9), we can directly calculate the mean wait for service (for 
arbitrary order of service) suffered by an arbitrary customer when placing 
a request: 

E( W)= " 2 E(W\N=s+j)P{N=s+j). (8.13) 
J-o 

Although (8.13) may be computationally useful, unfortunately it does not 
yield algebraically neat results, and we shall not pursue it further. 

As was true in the BCC case, the offered load a cannot be specified in 
the BCD case without first calculating its effect on the particular system to 
which it is offered. With BCD, the right-hand side of Equation (5.3) 
becomes 

Yi\ P A"] = 4"-tjPj[n]\ (8.14) 



Equations (5.3) and (8.14) imply 



a = 



n- 2jPj[n] 



In BCD systems, the carried load a' and the offered load a are equal: 

a' = a. (8.16) 

Equations (8.15) and (8.16) give the following relationship between the 
offered load per idle source d = y/n and the total offered load a: 



a = 



(8.17) 



Note that (8.17) could have been anticipated on intuitive grounds: The 
total offered load a is the product of the offered load per idle source a and 
the average number «-2J«. 1 y'i > y [«] of idle sources. 
It can be shown (see Exercise 37) that 

±jPj[n] = a{x + ^l). (8.18) 
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Thus the offered load a per idle source, the total offered load a, the mean 
wait E(W)/n~ x for service in units of mean service time, and the number 
n of sources are related according to the equation 



The offered load a is bounded from above by the intended offered load 
a* — n[a / {\ + a)} (why?), which provides an approximation to a when the 
probability of blocking is small. 

Further insight can be gained from Equation (8.19). Let T be the sojourn 
time (the sum of the waiting time and the service time) of a source. Then 
(8.19) can be written 



where \ = an is the actual rate at which requests for service are made by 
the sources. 

Equation (8.20) is especially useful when this model is used for computer 
performance evaluation. In this application, n terminals (sources) request 
the use of a computer (server) to process transactions. The length of time 
that a terminal takes to generate a request for the computer to process a 
transaction is called the think time; that is, in this application y~ l is the 
mean think time. The length of time from the instant a terminal generates 
a transaction (that is, requests service from the computer) until the com- 
puter completes the transaction (and instantaneously responds by com- 
municating this fact to the user at the terminal) is called the response time; 
hence E(T) is the mean response time. The rate at which transactions are 
processed (which equals the rate at which they are generated) is called the 
throughput; hence \ is the throughput, which is a measure of the computer 
system's processing power. In this application, then, Equation (8.20) relates 
the number of terminals, mean think time, mean response time, and 
throughput to each other. 

For example, suppose that 30 terminals with a mean think time of 20 
seconds are serviced by a single central processing unit (CPU) whose mean 
time to process a transaction is 1 second. A tedious but straightforward 
calculation [using (8.13) or its equivalent, with « = 30, y _1 =20, ju~ 1 = 1, 
and s=l] gives for the mean response time E(T) = E(JY) + fx~ l = 10.3 
seconds. Substituting this value into (8.20), we calculate the throughput 
X = 30/(20+ 10.3) = .99 transactions per second (which, for s=\ server 
with mean service time unity, also equals the utilization of the CPU). If a 
second CPU were added, then s = 2 and the mean response time would 
drop to 1.68 seconds; simultaneously, according to (8.20), the throughput 
would rise to 30/(20+1.68)= 1.38 transactions per second and the utiliza- 
tion of each of the CPUs would drop to 1.38/2 = 69%. 



a 



(8.19) 



n-a[l + nE(W)]' 



n = \(y~ 1 + E(T)), 



(8.20) 
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The derivation of Equations (8.19) and (8.20) outlined here is based on 
the assumption of quasirandom input and exponential service times; the 
derivation of formulas (8.3) and (8.4) for the probabilities [Pj[n]} uses 
these assumptions explicitly. Nevertheless, Equations (8.18) and (8.19) are 
true whether or not these assumptions hold and, in fact, can be shown to 
be true for very general models that describe a finite number of sources 
circulating through a closed network. As we shall discuss in Chapter 5, and 
as is hinted at in Exercise 37, Equation (8.18) can be written immediately 
as a special case of the very general theorem known as Little's theorem. In 
fact, Equation (8.20) can itself be written immediately from this theorem, 
without any other analysis; that argument will be given as an example in 
Section 5.2. (For an application of these results in the context of computer 
performance evaluation, see Chapters 4.1 1 and 4.12 of Kleinrock [1976] and 
Chapter 3.9 of Kobayashi [1978].) 

Exercises 

37. Verify Equation (8.18) by direct calculation. (This result can also be obtained, 
immediately, by application of the Little's theorem L*=\W; see Exercise 28 
and Section 5.2.) 

38. Reconsider Exercise 35, but instead of assumption (2), assume that blocked 
customers are delayed. For n = 4 and n = 5, find also the server occupancy, the 
mean waiting time, the proportion of requests that wait more than 45 seconds 
for service, and the proportion of time that an arbitrary source is idle (neither 
waiting nor in service). 

39. Using Equations (6.3) and (8.17), show that 

alljin] = (n -j)aPj[n] (j = 0, 1 , . . . , n) 

for systems with quasirandom input, exponential service times, and blocked 
customers delayed. Does this equation remain valid for other queue disci- 
plines? 

40. Consider a single-server queueing system with quasirandom input and 
blocked customers delayed, and assume that the service completion process is 
a pure death process with rate = jfi (thus the server speeds up as the queue 
size increases). Find IIj[n] (y'=0, l,... t n— 1). 

3.9. Summary 

In this chapter we have shown that certain queueing models can be treated 
within the framework of the birth-and-death process. In particular, we 
have studied four queueing models, with two types of input (Poisson and 
quasirandom), two different queue disciplines (BCC and BCD), and ex- 
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ponential service times. We have stated that although the assumption of 
exponential service times was used to justify the formulation as a birth- 
and-death process, the results derived for BCC are valid for any service- 
time distribution function with finite mean. We have also derived the 
waiting-time distribution function for the BCD case in which customers 
are served in their order of arrival, and we have outlined the derivation for 
service in random order. Each model was discussed in detail, and ways in 
which the various models relate to each other were pointed out. (It should 
be noted that in a common classification scheme, to be discussed later, the 
j-server Erlang delay model is referred to as M/ M/s.) 

Systems with quasirandom input differ in some basic respects from 
systems with Poisson input (even though every Poisson-input model can be 
obtained, by a limiting process, directly from its quasirandom-input coun- 
terpart). In particular, in contrast to systems with Poisson input, (1) 
finite-source systems can admit only a finite number of states, so that 
questions of convergence do not arise; (2) the magnitude of the offered 
load depends on the state probabilities of the system to which it is offered; 
and (3) the arriving customer's distribution and the outside observer's 
distribution are unequal, but the former agrees with the latter for the 
corresponding system with one less source. 

Finite-source models are discussed in detail by Syski [1960], who covers 
most of the material given here, including a summary of Cohen's paper on 
the generalization of the Engset formula. (As previously mentioned, tabu- 
lations of finite-source formulas are given in Descloux [1962] and Peck and 
Hazelwood [1958].) 

Poisson input models have been more widely studied. Several references 
were given in this chapter, and many others are available. Of particular 
historical interest, Fry [1928, second edition 1965] discusses both Poisson 
and finite-source models in the context of teletraffic theory. 

From a practical viewpoint, the Erlang loss and Erlang delay models are 
the most important, and situations for which they may be realistic models 
should be self-evident. Some interesting papers that are notable for their 
integration of theoretical and practical (numerical) results concerning the 
use of the Erlang loss model and some of its variants in teletraffic studies 
are Benes [1959, 1961], Descloux [1965, 1973], Jagerman [1975], Kuczura 
and Neal [1972], and Neal and Kuczura [1973]. The finite-source models 
are more difficult to understand and calculate but, as noted above, have 
found several important applications. 

Some of the material presented in this chapter is not well known outside 
of the literature of teletraffic theory. In particular, the concepts of offered 
load and carried load, and the relationship between the arriving customer's 
distribution and the outside observer's distribution, have not received the 
attention in the general literature of queueing theory that they seem to 
warrant. 
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Exercises 

41. Queue with feedback. Customers request service on a BCD basis from a group 
of s servers according to a Poisson process with rate X. Upon seizing an idle 
server, a customer holds it for an exponentially distributed length of time with 
mean fi~ l . Upon leaving the server, the customer rejoins the queue (as if he 
were a new arrival) with probability p, and leaves the system with probability 
1— p. Find the statistical-equilibrium probability that a new arrival finds all 
servers busy. 

42. A single server serves customers of two priority classes on a preemptive basis: 
High-priority customers will preempt the server from low-priority customers, 
so that no low-priority customer is ever served when a high-priority customer 
is present. High-priority customers arrive according to a Poisson process at 
rate \ x and have exponential service times with mean jtj -1 . Low-priority 
customers arrive according to a Poisson process at rate \ 2 , and have exponen- 
tial service times with mean jl^ - Assume preemptive-repeat priority discipline 
(a new value of the service time is chosen from the exponential distribution 
each time a preempted customer enters the server). All customers wait as lqng 
as necessary for service. 3 

a. Find the probability that a low-priority customer is preempted from the 
server n times. 

b. Find the distribution function of the total length of time that a low-priority 
customer occupies the server. 

c Find the mean length of time from the instant a low-priority customer first 
enters service until he leaves the system, assuming that when a customer is 
preempted no other low-priority customer can enter service until the 
preempted customer completes his service. 

d. Find the waiting-time distribution function for high-priority customers, 
assuming service in order of arrival within priority classes. 

e. What criteria must be met to ensure bounded delays for customers of each 
class? 

f. Repeat parts a-e assuming preemptive-resume priority discipline (preemp- 
ted customers resume service where they left off)- 

43. Priority reservation. A group of s telephone trunks is offered «j erl of 
eastbound Poisson traffic and a 2 erl of westbound Poisson traffic. Both 
eastbound and westbound call holding times are exponentially distributed 
with common mean. To give the eastbound traffic better service than the 
westbound traffic, any westbound call that finds less than n + 1 (n <s) trunks 
idle is cleared, whereas eastbound requests are cleared only when all s trunks 
are busy. 

a. Write the state equations that determine P p the statistical-equilibrium 
probability that j calls are in the system (J=0, 1,2,...,j). Include all 
boundary conditions. 

b. Solve the equations for Pj (j = Q,\ i 2,...,s). 

c. Write the expressions (in terms of Pj) for the proportions of eastbound and 
westbound calls lost. 
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44. In order to minimize its telephone bill, a company decides to investigate 
leasing a number of flat-rate telephone trunks between two locations. Man- 
agement has decided that nonadministrative personnel will be given access to 
the flat-rate trunks only, while administrators will have access to the regular 
toll network in addition to the flat-rate trunks. Suppose that the situation can 
be described by the following model. 

High-priority calls originate according to a Poisson process at rate A, 
attempts per minute. These calls are first offered to the group of s flat-rate 
trunks. If a high-priority call finds all s flat-rate trunks busy, it will im- 
mediately be offered to the toll network where, it is assumed, there will always 
be at least one idle trunk. Low-priority calls originate according to a Poisson 
process at rate A 2 attempts per minute. These calls are offered to the flat-rate 
trunks only; if a low-priority call finds all flat-rate trunks busy, it waits in a 
queue until a flat-rate trunk becomes available. The holding times for both 
types of call are exponentially distributed with mean length ju, ~ 1 minutes. 

Show that if the company leases the flat-rate trunks at a cost of c dollars 
per trunk per minute, the overall cost per minute c(s) is given by 

c(s) = cs + \ l [r +re-> 1 (l-e-> i y i ]B(s), 

where r is the cost of a toll call for the first minute or any fraction thereof, r 
is the cost of a toll call for each additional minute or any fraction thereof, and 

sB(s,a, + a 2 ) 

B( S ) = (o 2 <s) 

s-a 2 [l-B(s,a t + a 2 )] 

where B (s,a) is the Erlang B formula, and B(s) = 1 when a 2 3*s, where a, = X, / jul and 
a 2 = \ 2 /|jl. Find the mean waiting time for low-priority calls and the occupancy of the 
flat-rate trunks. 



45. Time-varying Poisson input, 

a. Consider a generalization of the Poisson process in which the parameter \ 
can vary with time (time-varying Poisson input); that is, assume that as 
h—*0 the probability of an arrival in any interval (x, x + h) is \(x)h + o(h) 
and the probability of more than one arrival is o(h). Let Pj(t) be the 
probability of exactly j arrivals in (0, /), and show that 

Pji0 -w£y e -«'> o=o,i,...), (i) 

where 

A(0- ('Mx)dx. (2) 
-'o 

b. Consider the transient analysis of the infinite-server queue with (ordinary) 
Poisson input; the probability Pj(t) that exactly j customers are present at 
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any time / is given by Equation (3.11). Show that the number of customers 
present at time t can be interpreted as the number of arrivals in (0,/) 
according to a time-varying Poisson process in which the effective arrival 
rate at every time x (0<x<t) is A(x)=\[l - H(t — x)]. Conclude that (1) 
applies; show that (2) yields A(t)*=\tp(t), where p(t) is given by (3.9); and 
compare with (3.11). 
This exercise was suggested by B/afrge Tilt. 

46. Transient analysis of the single-server Erlang delay model. 

a. Show that if we adopt the time scale in which the mean service time is the 
time unit, then the probability Pj(t) that exactly j customers are present at 
time / satisfies the equations 

-^P (t) = -aP {t) + P x (t) (1) 

and 

^ y (0=<- I (0-(i+^0)+i > J+ ,(0 0=1,2,...), (2) 

where a is the offered load. 

b. In order to solve the equations (1) and (2), consider the auxiliary system of 
equations 

^P j (t) = aP J ^(t)-(l + a)P J (t) + P J+l (t) (y = 0, ±1, ±2,...) (3) 

and 

P Q (t) = aP_M, (4) 

and show that 

PjO) = Pj(t) (7 = 0,1,2,...); (5) 

that is, any solution of (3) and (4) for y = 0, ± 1, ±2,... will also be a 
solution of (1) and (2) for j = y 1,2,... . (The values of Pj(t) for j = 
— 1,-2,... are irrelevant.) 
c Define the generating function 

A*,0- f Pj(t)zJ y (6) 

j--oo 

and show that P(z,t) satisfies the differential equation 

± P(z, t) = [az - (1 + a) + z ~ i}P(z, /), (7) 
the most general solution to which is 

P(z,t)=G(z) exp[ - ( 1 + a) t + (az + z ~ l ) t\, (8) 
where G(z) is an arbitrary function of z. 
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d. Using the fact that the modified Bessel functions I k (y) are given by the 
well-known expansion 



exp[^ + ^" ! )]= 2 AGO**. 



(9) 



fc— — OO 



show that 



k" — oo 



(10) 



<?(*)- 2 <^ 

7- -oo 



e. Now assume that G(z) can be represented by^an expansion of the form 

(11) 

and show, by equating coefficients in (6) and (10), that 

4(f) -e"* 1 *-* 2 c k a^ + ^I J+k (2a^ 2 t) (12) 



*- -oo 



(where the coefficients {c k } remain to be determined), 
f. Show that Equation (9) implies that 7 (0)= 1 and 4(0) = when k=^0; and 
therefore, if the number of customers present at t—0 is /, then c,™ 1 and 
c k =0 when k < and kj=—i. Thus, Equation (12) can be written 



-(l + a)t 



^•-'•>/,„ ( .(2a'/ 2 0+ 2 c k a^ +k >I J+k (2a l /h) 

k- 1 



(13) 



g. Show that Equations (13) and (4) imply 

«"*'/-,+ 2 d k I k = a-r+h_ (i+l) + a 1 / 2 2 44-i, (14) 

k-\ k-l 

where we have written d k = c k a kjf2 and I k — I k {2a l ^ 2 t). 

h. Using the fact that I k = I- k [as is easily verified by replacing x by x~ x in 
both sides of (9)], show that (14) implies 

4 = (k =1,2,...,/), 

_ 1 — A 

a i+l = a 2' 2, 
4 +1+BI -a-i«'-J m -^(l-fl) (m=l,2,...). 
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i. Conclude that 



+ 0-«) 2 a- fc/2 /, +I+ *( 2 a 1/2 

Jt«2 



(15) 



j. For what values of the offered load a is Equation (15) valid? 
This derivation of (15) is that of Cox and Smith [1961]. For further discussion, 
generalizations, and references, see also Cohen [1969], Riordan [1962], Syski 
[1960], and Takacs [1962a]. 
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Multidimensional Birth-and-Death 
Queueing Models 



In this chapter we shall discuss methods of analysis of queueing models for 
which the natural definition of a state requires more than one variable. Our 
approach will be through the use of examples that illustrate various 
techniques and often represent systems important in themselves. 

4.1. Introduction 

In Section 2.2 we studied the one-dimensional birth-and-death process, the 
essential characteristic of which is that the system occupies "states" E , 
E v ... and the rates at which changes of state occur depend only on the 
instantaneous state of the system and not on the past history of the process 
(the Markov property). In Section 2.3 we showed that, under appropriate 
conditions often assumed to hold in practice, the state probabilities {Pj} 
are related to each other according to the equations (3.11) of Chapter 2, 
which we restate for convenience: 

(\ + h) p j = h-i p j- 1 + 1* + a = o, i, . . . x (i.i) 

where {A,} and { jl^} are the rates of transition upward and downward, 
respectively, and X _ l = (j^ = 0. 

We noted that the equations (1.1) have the following simple and im- 
portant interpretation: For each state, the rate at which the system leaves 
the state equals the rate at which the system enters the state. In other 
words, these equations express a law of conservation of flow, that is, rate 
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out — rate in. The equations (1.1) are easily solved by recurrence [the 
normalized solution is given by the equations (3.10) of Chapter 2]. 

In this chapter we consider systems that have the Markov property, but 
whose states require more than one variable for definition. For example, 
consider the infinite-server queue with Poisson input at rate A and two 
types of customers, where the service times of customers of type i (i— 1,2) 
are independent, identically distributed, exponential random variables with 
mean duration t,-. If t, = t 2 = t, then the equilibrium probabilities Pj 
(J = 0, 1,...) that there are j customers in the system satisfy the equations 
(1.1) with Xf^X and iij—jr~ l . However, if r l =£? 2 > one cannot specify the 
death rate without knowledge of the composition of the state E r But if 
we define the two-dimensional states {Ej j 2 }> where j t is the number of 
customers of type i in the system, with corresponding equilibrium distribu- 
tion {P(J\Jt)}> then clearly the Markov property still holds, and the death 
rate corresponding to the state Ej ^ is 

/*OW2)"^i T r I+ >2 T 2~ l - 

Note that the one-dimensional equations (1.1) are not incorrect in this 
case; they are simply inapplicable because the death rates cannot be 
specified. 

In what follows, we consider systems that can be described by a 
multidimensional analogue of the simple birth-and-death equations (1.1). 
More precisely, we assume that systems that have the Markov property 
when the states are properly defined have equilibrium state distributions 
that satisfy "rate out = rate in" equations analogous to (1.1). Unlike the 
equation for the one-dimensional case, however, the analogous set of 
multidimensional difference equations cannot, in general, be solved by 
recurrence. We shall devote this chapter to a discussion of queueing 
models, many of practical importance, that can be formulated as multidi- 
mensional birth-and-death processes, and to special techniques for solving 
the equations of conservation of flow that are assumed to describe these 
systems in equilibrium. 

As an example of a multidimensional birth-and-death queueing model, 
we consider a finite-source system with nonidentical sources. In particular, 
imagine a single-server system with two sources. Assume that source /' 
generates requests at constant rate y, when idle and rate otherwise, and 
has exponential service times with mean Assume that the queue 

discipline is blocked customers delayed. Let 5, = if source * is idle, 5, = 1 
if source i is being served, and S t =2 if source i is waiting for service. 
Finally, let P{S x =j, S 2 = k} = P(j,k) CM = 0,1,2) be the statistical- 
equilibrium state distribution. Now, the reader should be convinced that 
the conservation-of-flow equations, which equate the rate the system leaves 
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each state to the rate at which it enters that state, are 



(Yi + Y 2 m0,0) 
(Y 2 + MiW>0) 
(^ + ^(0,1) 



y 1 /^0,0) + ft P(2, 1), 
y 2 P(0,0) + t i 1 P(l,2), 

y 2 p(i,o), 



Ml P(l,2) 
^(2,1) 



Observe that the sum of the terms on the left-hand side is identical to the 
sum of the terms on the right-hand side; thus any one of these equations is 
redundant and can be ignored. The four remaining equations, together 
with the normalization equation 



uniquely determine the unknown probabilities. 
Exercise 

1. Finite-source systems with nonidentical sources. We proved in Section 3.6 that in 
any equilibrium n-source (one-dimensional) birth-and-death queueing system 
with quasirandom input (identical sources), the state distribution seen by a 
source when placing a request is the same as would be seen by an outside 
observer of the corresponding (n — l)-source system. This suggests that for 
(multidimensional) birth-and-death systems with nonidentical sources, the state 
distribution seen by a particular source when placing a request is the same as 
that particular source would see if he placed no requests, but instead acted as 
an outside observer of the system. Descloux [1967] has observed that this 
generalization cannot always be made, and has given the necessary and 
sufficient conditions for its validity. To illustrate this, consider a system 
composed of a single server and two nonidentical sources, as above, but, unless 
stated otherwise, make no assumption about the queue discipline, 
a. Show that the probability b 2 that source 2 is blocked (finds the server 
occupied when placing a request for service) is given by 



P(0,0) + P(1,0) + P(0, 1) + P(l,2) + P(2, 1) = 1, 



P(0,0)+/»(1,0)' 



b. Show that if source 2 makes no requests for service but instead acts the part 
of an outside observer, then the probability b' 2 that source 2 sees the server 
occupied is given by 
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c Assume that blocked customers are cleared. Find the distribution {P(J>k)} 

and show that b 2 = b' 2 . 
d. Assume that blocked customers are delayed. Find the distribution {P(J,k)} 

and show that 

b = (Yi/feXfa + Vi + Vi) 

2 Mi + (Ml/P2)Yl + r2 + (Yl/M2)(M2 + Yl + Y2) ' 

Show that b 2 = b' 2 M and only if p x = p. 2 . 
This exercise was suggested by P. J. Burke. 

4.2. Product Solutions (Mixed Traffic, Queues in Tandem) 

Infinite-Server Group with Two Types of Customers 

Consider an infinite-server group that receives two distinct types of Pois- 
son traffic (such as ordinary telephone conversations and data) with arrival 
rates \ t and \ 2 . Assume that customers of type 1 have exponentially 
distributed service times with mean /a, -1 , that customers of type 2 have 
exponentially distributed service times with mean p^ 1 , and that each value 
of a service time is obtained from its corresponding distribution indepen- 
dently of all others and the arrival process. Let P(J\Ji) De *h e statistical- 
equilibrium joint probability that at any instant there are j x customers of 
type 1 and j 2 customers of type 2 in service. Then, equating rate out to rate 
in for each state, the statistical-equilibrium state equations are 

(^ + \ 2 +j t /*! +j 2 ii 2 )P{j l J 2 ) 

=X l P(Ji ~ 1 J 2 ) + VOV2- 

+ (Ji + l)f»i *C/i + W2) + O2 + 1 WO'i Ji + 1) 
[P(-lJJ-PUi.-l)-0; j\ = 0,\,...; y 2 =0,l,...]. (2.1) 

In addition, of course, the probabilities must satisfy the normalization 
equation 

2 2 /•OVJ-l. (2-2) 

In this case, we already know the answer. Since the server group is 
infinite, the two types of customers do not affect one another. Thus the 
marginal distribution of the number of customers of each type is that 
which would be obtained by solving the corresponding one-dimensional 
problem, namely the Poisson distribution: 

PiU) = 2 P(m = £^ e~<M»\ (2.3) 

k = J' 

P 2 (J)= 2 P(kJ)=^^e~^\ (2.4) 
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Since the number of customers present of each type is independent of the 
number present of the other type, therefore P(j r'jj 2 ) — ?\U ^PiUt)'^ that is, 

P{Ji j 2 ) = (VM!)* (Mj^ e - I <x l/l . 1 )+<V^]. (2.5) 
Ji' Ji' 

Let us substitute the solution (2.5) into the equations (2.1). We see 
immediately that, as far as (2.1) is concerned, the factor 



exp 



^1 + ^2 

Mi fh. 



is irrelevant, since it divides out. More importantly, each factor on the left 
of (2.1) corresponds to a single factor on the right and vice versa. 
Specifically, it is easy to verify that (2.5) satisfies the equations 

= Ux + OMi PUi + IJi), 

VOW2M./2 + l)M2^0'lJ2+ 1), 

Ji Mi^O'i J 2 ) - *i HJi ~ 1 Jz)> ( 2 - 6 ) 
h P2PU1J2) = ~ !)- 

[Observe that the first and third equations of the set (2.6) are identical, as 
are the second and fourth.] Thus we have shown that, in this case, the 
conservation-of-flow equation (2.1) can be decomposed into the four 
separate conservation-of-flow equations (2.6). However, there seems to be 
no intuitive reason to predict that this decomposition should be possible. 
(For the reader interested in further theoretical considerations, we mention 
that this decomposition property is related to Kolmogorov' s criterion for 
reversibility — see Keilson [1965, 1979] and Kelly [1979] for discussion and 
references.) The important point to be made here is that, as we shall 
demonstrate, the decomposition property holds for a very large variety of 
models that do not appear similar at first glance; and as a practical matter, 
this property of decomposition implies that a "simple" solution of the basic 
conservation-of-flow equations exists. 

For example, the first equation of the set (2.6) shows that 

njvJi) n — p y^Jih 

Ji- 

similarly, it follows from the second equation of (2.6) that 
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Thus we see that the decomposition implies (and is implied by) the 
statement that the solution PU1J2) ls a product of two factors, one 
depending functionally on only the index j v the other on only j 2 : 



PUxJi^PxU^PiUil (2-7) 
or equivalently, in this case, 

p(jvJi) = — n n — c > < 2 - 8 ) 

Jl- J2- 



where the constant c is determined from the normalization equation (2.2). 

In practice, a good strategy is to assume a product solution of the form 
(2.8) (or some other form) and see if it satisfies the original conservation- 
of-flow equations. If it does, then the solution has been obtained, except 
for a multiplicative constant required to make the probabilities sum to 
unity. (That this solution is unique can be argued on physical grounds.) If 
it doesn't, not much effort has been expended, and a different approach 
can be undertaken. (The reader may recognize this strategy as the method 
of separation of variables, widely used in solving partial differential equa- 
tions.) The extension to models with more than two variables should be 
clear. The question of exactly what form of product solution should be 
tried is not easily answered, except to say that the examples of the 
following sections should be helpful. 

Finally, with regard to the particular model used as an example here, we 
remark that although we have assumed that the service times for each type 
of customer are exponentially distributed, we know from previous consid- 
erations that the marginal distributions are of the form (2.3) and (2.4) for 
any distribution of service times with means juj -1 and /x 2 -1 . It is probably 
true that in any system in which customers of a certain type are cleared 
when blocked, or when no blocking can occur, the results are independent 
of the form of the service-time distribution function for that type. (See 
Fortet and Grandjean [1963].) 

Finite-Server Group with Two Types of Customers and Blocked 
Customers Cleared 

As in the previous example, we assume two types of customers, with 
customers of type i characterized by parameters \ and /a,; but the number 
of servers is now s < 00, and blocked customers are cleared. The system is 
again described by (2.1), which is valid now only for ji+j 2 <s. When 
j\+j2 — s, then the states E Ji + l j 2 and Ej t j +i cannot occur. Therefore, 
again equating the rate of flow out of state Ej j to the rate of flow into 
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E { ; , we have 

Ji'Ji' 

Observe that (2.9) can be obtained from (2.1) by deleting the first two 
terms on the left and the last two terms on the right. It follows that the 
product solution (2.8) also satisfies (2.9), since each term deleted (remain- 
ing) on the left corresponds to a term deleted (remaining) on the right. 

Thus, in this case the boundary conditions do not change the form of 
the solution, which is again given by Equation (2.8). The probability P(J) 
that there are j customers in service is given by PU) = 2j i+ j 2 „jP(j\J 2 ), 
which, with the help of the binomial theorem, reduces to 




(2.10) 



The normalization equation, 2*«oA^) = l> implies that 




We conclude that 

EL 

P (>)=-r-T 0-0,1,...,.*), (2.11) 



where a = (k x / n l ) + (X 2 / fi^. This result agrees, of course, with that ob- 
tained by different reasoning in Section 3.3: When two streams of Poisson 
traffic of magnitudes a x erl and a 2 erl are simultaneously served by a group 
of s servers on a BCC basis, each stream sees the state distribution, given 
by the Erlang loss (truncated Poisson) probabilities with the parameter 
a ss a l + a 2 . 



Exercises 

2. a. Three cities, A, B, and C, are interconnected by two trunk groups, group 1 
consisting of s { trunks connecting A and B, and group 2 consisting of s 2 
trunks between B and C (Figure 4.1). A call between A and C simulta- 
neously holds a trunk in each group. Blocked calls are cleared. Write and 
solve the equilibrium state equations for this network. 
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B 




Figure 4.1. 



b. Consider the network of part a augmented by the addition of group of s 
trunks directly connecting A and C. In the augmented network, all calls 
between A and C are carried on the new group if a trunk is available. Calls 
between A and C that find all s trunks in the new group busy are 
alternate-routed via groups 1 and 2 if there is an idle trunk in each group, 
and are cleared otherwise. Whenever a direct trunk between A and C 
becomes available and there is at least one overflow call in progress, an 
overflow call will be switched down from the alternate route to the direct 
route, thus using one less trunk to complete the call. Write and solve the 
equilibrium state equations for this network. 

c Consider the same network as in part b, but without the capability of 
switching alternate-routed calls down to the direct route. Is this system 
easier or harder to analyze than that of part b? Why? 

d. Suppose now that calls offered to any of the three direct groups can be 
alternate-routed via the other two, and that any alternate-routed call can be 
switched back to its direct route. Write and solve the equilibrium state 
equations for this network. 

3. A group of s trunks serves two types of Poisson traffic on a BCC basis. Type 1 
is ordinary telephone traffic, with arrival rate \ x and exponential holding times 
with mean /if 1 . Type 2 is "wideband" data traffic, with arrival rate \ 2 
exponential holding times with mean /if 1 - Each wideband call requires the 
simultaneous use of k trunks for transmission. If a wideband call finds less 
than k idle trunks, it is cleared from the system. Find the proportion of 
ordinary calls and the proportion of wideband calls that are lost. 

4. A group of s servers handles n types of customers. A customer of type / is 
cleared from the system if on arrival he finds all servers busy or k t <s other 
customers of his own type in service (/'= l,2,...,/i). Customers of type /" arrive 
in an independent Poisson stream at rate \, and have exponential service times 
with mean ju, -1 . Find the probability of blocking seen by customers of each 
type. 

5. A group of s servers handles two types of customers on a BCC basis. Type 1 is 
Poisson traffic with arrival rate A. Type 2 is quasirandom traffic generated by n 
sources, each bidding with rate y when idle. Service times are exponential with 
means /if 1 and /if l , respectively. 
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a. Find the equilibrium probability PJJxJj) that there are y, type-1 customers 
and j 2 of the n type-2 customers simultaneously in service at an arbitrary 
instant. 

b. Define 

r#OW 2 ;0« iim p{^ t (o-y„ iv 2 (o=y 2 |c,(M+/o}, 

where JV}(0 is the number of customers of type / that are in service at time t 
and C,(t,t + h) is the event {a customer of type /' makes a request for service 
in (t,t + h)}; and let 

mOWa)= lim n i „(jj 2 ;t). 

t-*ac 

Show that 
and 

[Therefore, the probability of blocking suffered by customers of type 1 is 
2j l+ j 2 - s P„( j\,j2), and that suffered by customers of type 2 is 

6. Calls arrive according to a Poisson process with rate X at a call distributor that 
routes each call to one of two trunk groups, composed of s x and s 2 trunks, 
respectively. Let n x >s x and n 2 >s 2 be integers. If an arrival occurs when j f calls 
are in progress on group /' (7=1,2), then with probability (n i —j i )/(n 1 + n 2 — 
ji —j 2 ) the new arrival is routed to group /. A call routed to group / that finds 
all s t trunks busy is cleared from the system. Call holding times are exponential 
with mean p~ l , and X//i = a. If P(J\Ji) * s me equilibrium probability that 
there are j x calls on group 1 and j 2 calls on group 2, show that (see Appendix D 
of Buchner and Neal [1971]) 



n l + n 2 \ (j x +j 2 )\ 



[Ji+Ji J 

iow might one surmise a solution of this form? 



Customers arrive according to a Poisson process with rate X at a group of s 
xponential heterogeneous servers; that is, the servers are numbered 1,2,...,j, 
nd the time required for server / to serve a customer is exponentially 
distributed with mean fij~ l . All of the servers that are idle when a customer 
arrives are equally likely to be selected by the customer, and blocked customers 
are cleared. Define the random variables X t = when the ith server is idle and 
Xj= 1 when the ith server is busy (/= 1,2,. ..,s), and let P(x h ...,x s ) — P{X l = 
x l> ...,X s = x s } be the equilibrium state probabilities (x r = 0, 1 for each / = 
1,2,. ..,5). 
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a. Show, with no calculation, that if /tj 



• • = /*r = ju, then 



P(X U ... >Xs)=(j) Pj 



when Xj + • • - +x s =j, 



and 



2 P(x x ,...,x s ) = Pj = 0,1, ...,*), 



where {Pj} is the Erlang loss distribution (3.3) of Chapter 3. 
b. Write the statistical-equilibrium state equations that determine the distribu- 
tion {P(x l ,...,x s )} in the general case (the service rates are no longer 
assumed to be equal), and show that the solution is 



Verify that this solution can be specialized to yield the results of part a. 
Queues in Tandem 

Consider two sets of servers arranged in tandem, so that the output 
(customers completing service) from the first set of servers is the input to 
the second set. Assume that the arrival process at the first stage of this 
tandem queueing system is Poisson with rate X, the service times in the first 
stage are exponential with mean /if \ and the queue discipline is blocked 
customers delayed. The customers completing service in the first stage 
enter the second stage, where the service times are assumed to be exponen- 
tial with mean /if 1 . Customers leaving the first stage who find all servers 
occupied in the second stage wait in a queue in the second stage until they 
are served. The number of servers in stage * is s t . 

Let PU1J2) be tne statistical-equilibrium probability that there are j x 
customers in stage 1 and j 2 customers in stage 2. To save rewriting the state 
equations for each set of boundary conditions, let 



Then the statistical-equilibrium state equations, obtained by equating the 




where 





(2.12) 
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rate the system leaves each state to the rate it enters that state, are 

[X+/iiO'i) + /*aC/2)]^C/W2) 

= XPO\ ~ 1 J 2 ) + MiC/i + 1 Wi + iJ 2 - 

+ ^0 2 +i)POv 2 +i) 

[P(-l,j 2 ) = P(j i ,-l) = 0; 7,-0,1,...; y 2 = 0,l,...]. (2.13) 

The term ii x (J l + \)P(J x -¥\,j 1 -\) reflects the fact that a departure from 
stage 1 constitutes an arrival at stage 2. 

In the previous two examples, it has turned out that the joint probability 
PU1J2) could be written as the product of two factors P x {j\) an d P2U2) '• 

WM-PiUdPiUJ. (2-14) 

In the example under consideration, we can evaluate without calculation 
the marginal distribution of the number of customers in the first stage; for, 
the first stage of this tandem queueing system is precisely an Erlang delay 
system, so that the marginal distribution of the number of customers in 
stage 1 is given by the Erlang delay probabilities (4.3), (4.4), and (4.5) of 
Chapter 3. 

Therefore, because of our previous success, and because it is certainly 
worth a try, let us assume a product solution of the form (2.14), with the 
factor .PjOi) given by the Erlang delay probabilities: 



A0.) = 



ci— — Oi — r l Ji — 1), 



(2.15) 



We shall substitute the assumed product solution (2.14) and (2.15) into the 
equilibrium state equations (2.13), with the hope that we will be left with a 
one-dimensional set of equations that we can solve for the remaining 
factor P2U2) °f our assumed product solution. [If our guess turns out to be 
correct, then, since (2.14) holds for all 7, and j 2 , it follows that the factors 
on the right-hand side of (2.14) are, in fact, the marginal probabilities for 
each stage.] 

It is readily verified that substitution of the assumed product solution 
(2.14) and (2.15) into the state equations (2.13) yields, after cancellation, 

[A + fhUi) ] P2U2) - A^U ~ 1) + H2O2 + 1)^2(72 + 1) 



[P 2 (-1) = 0; y 2 = 0,l,...]. 



(2.16) 
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Inspection of the equations (2.16) shows that these are precisely the 
equilibrium state equations that, except for the normalization condition, 
define the Erlang delay probabilities. We conclude that P2U2) is given by 



P2U2)- 



c 2 — — 2 = 0,1,...,j 2 -1), 

J 2' 



(2.17) 



Since 



2W)-1 0-1.2), (2.18) 

j = 



it follows that if we define a ( = \/ ft, then 



c '=L?„^ + ^kT^a)' (i -'- 2) - (2 - 19) 



This equation implies, as expected, that a proper joint distribution exists 
only when X/ jUj <s x and A/ jn 2 <.s 2 . 

Note that, as in the previous cases, each term on the left-hand side of 
Equation (2.13) corresponds to a single term on the right-hand side of 
(2.13) and vice versa. Specifically, we have the correspondences 

^PUxJi) = M2C/2 + 1 Wi J2 + !)> 

as can be easily verified by substitution. We have thus derived three 
conservation-of-flow equations that hold in this particular problem. How- 
ever, there was no intuitive reason to predict that these conservation laws 
would hold individually, but only that they would hold jointly [which, of 
course, is the basis for Equation (2.13)]. This must be viewed as an 
enlightening and unexpected result. 

We have shown that the equilibrium joint probability P(J\Ji) factors 
into a product of two Erlang delay probabilities, P x (j\) an d PiUi)- We 
conclude that the number of customers in either stage of this tandem 
queue is independent of the number of customers simultaneously present 
in the other stage. 
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We have also shown that the marginal distribution of the number of 
customers in the second stage is the Erlang delay distribution; that is, the 
distribution of the number of customers in the second stage is exactly that 
which would obtain if the first stage were not present and the customers 
arrived according to a Poisson process directly at the second stage. 

These remarkable results, which were first obtained by R. R. P. Jackson 
[1954, 1956], suggest that the output process might, in fact, be the same as 
the input process to the first stage, that is, a Poisson process. The truth of 
this conjecture was proved by Burke [1956]; Burke's theorem states that 
the sequence of departures from an Erlang delay system in equilibrium 
follows a Poisson process and, further, the state of this Erlang delay system 
at any arbitrary time t is independent of the departure process previous to 
t . (These results were also obtained by Reich [1957].) It easily follows 
from Burke's theorem that the analysis given here for the case of two 
queues in tandem can be extended to the case of an arbitrary number of 
queues in tandem. A proof of Burke's theorem is outlined in Exercise 2 of 
Chapter 5. 

As one might guess, there are many other remarkable facts about 
tandem queueing systems and departure processes to be uncovered. For 
example, in single-server tandem queues with service in order of arrival, 
the sojourn times (sojourn time is waiting time plus service time) of an 
individual customer in the various stages of the tandem queueing systems 
are independent random variables; however, the waiting times are not 
independent. Surveys of results on departure processes and tandem queues 
are given by Burke [1972] and Daley [1976]. 

The model of queues in tandem generalizes naturally into that of 
networks of queues (see Exercises 8 and 9). This is an area of active 
research, with important applications in telecommunications and computer 
science. (The entire September 1978 issue of Computing Surveys, Vol. 10, 
No. 3, pp. 219-362, is entitled "Queueing Network Models of Computer 
System Performance.") Some recent works on networks of queues are 
Barbour [1976]; Baskett, Chandy, Muntz, and Palacios [1975]; Beutler and 
Melamed [1978]; Beutler, Melamed, and Ziegler [1977]; Boxma [1979a,b]; 
Chandy, Howard, and Towsley [1977]; Courtois [1977]; Disney [1975]; 
Kelly [1976, 1979]; Lemoine [1977, 1978, 1979]; Melamed [1979a,b]; 
Mitrani [1979]; Noetzel [1979]; Pittel [1979]; Schassberger [1978a,b]; 
Simon and Foley [1979]; Stoyan [1977]; and Vantilborgh [1978]. Addi- 
tional references are given in Exercises 8 and 9. 

Exercises 

8. Networks of queues. 

a. Consider a network of queues, say Q i7 Q 2 , Q$, and Q A , with input-output 
structure as in Figure 4.2, where each departing customer from Q l is 
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Figure 4.2 



assigned, independently, to either Q 2 or Q$> with probabilities p l2 and p u 
(= 1 — p l2 ), respectively; and similarly for departures from Q 2 . Assume also 
that the input to Q y is Poisson, with rate A; that Q t is served by s, servers, 
with exponential service times with mean nf 1 ; and that all customers wait 
as long as necessary for service. Find the equilibrium distribution of the 
number of customers present at each queue, 
b. Consider the same network as in part a, except that now the portion of the 
output from Q 2 that does not go to Q 4 feeds back as input to Q x instead of 
leaving the system. Write and solve the equations that determine the 
equilibrium distribution of the number of customers present at each queue. 
(Networks of this type, with feedback, were first considered by J. R. 
Jackson [1957], who observed that, as in part a, the results are the same as 
one would obtain if the input to each queue were a Poisson process. 
Jackson remarked that this was "far from surprising in view of recent 
papers by P. J. Burke and E. Reich." However, as Burke [1972] points out, 
Jackson's results are, in fact, quite astonishing, because the combined input 
to any queue that receives feedback is not Poisson: 

The point is that in networks without feedback not only are the states of the 
component queues distributed as if the inputs are Poisson, these actually are 
Poisson; whereas when feedback is present the only statement that can be made 
is that these distributions are the same as those which would arise if the inputs 
were in fact Poisson. 

For example, the results of Exercise 41 of Chapter 3 can be obtained by 
assuming (incorrectly) that Burke's theorem applies and therefore the 
combined stream of new and fed-back arrivals is a Poisson stream. Burke 
[1976] proves that this is not the case; nevertheless, the system behaves as if 
it were the case. Recently, Beutler and Melamed [1978] and Melamed 
[1979a] have shown that Burke's theorem can be generalized to apply to the 
exit arcs of Jackson-type networks with single-server nodes. For additional 
related results see Disney, McNickle, and Simon [1980], and the references 
cited therein; see also Disney, Farrell, and de Morais [1973], and Foley 
[1979].) 



9. Closed networks of queues. 

a. Consider a network of m queues, Qi,Q2>->Q m > tne structure as in 
Figure 4.3. Assume that Q ( is served by s t servers, with exponential service 
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Qm 



Figure 43 



times with mean ft/" 1 ; that all customers wait in each queue as long as 
necessary for service; and that the number n of customers in the network 
is fixed. (Thus, the n customers circulate continually among the m stages of 
the network.) Let P(J x ,... t j^ be the equilibrium joint probability that the 
number N t of customers in stage * (waiting or in service) is j, , (i = 1, 2, . . . , m), 
and show that 



n aa) 



i-\ 



s n &(/■,) 

S i-l 



(1) 



where S denotes all combinations of the nonnegative integers ri,...,r m for 
which r x + • • • + r m = n, and 



QiUi)= 



0)<Si), 



(j) > s i)- 



Are the random variables Ni,...,N m independent? 
b. The model of part a is a specialization of one considered by Gordon and 
Newell [1967], in which each departure from stage / decides, independently, 
to enter stage j with probability py (ij= l,2,...,m). Convince yourself that 
the conservation-of-flow equations that determine the equilibrium joint 
distribution PUi>---J m ) °f me number of customers present in each stage 
of Gordon and NewelFs model are 



2 ftC//Wi,».JJ- 2 2 PkUk+i)PkiP(Ji,'--Ji-h---Jk+h...j m ) 

i-l i = \ *»1 



+ 2 PiUl)PuP(Jw-Jm)> 
i-l 



(2) 



where ju,(y) is defined as in (2.12), and, of course, 2F(y'i,...J m )= 1. Show 
that if one again assumes a solution of the form (1), with QjO)) now 
defined as 



QiUd=- 



OA)* 



Ui<3i), 



Ui>S t ), 



(3) 
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then it follows from substitution of the assumed solution (1) and (3) into 
the equilibrium state equations (2) that 



2 PtUi) 



i- 1 



\ 2 Mfc X Pki 



=0. (4) 



Observe that if (4) is to hold for every y, (0 < y, <n), the term in brackets 
must vanish for each i; that is, 

m 

2 />«(***- | )-ft*r 1 = U,...,m). (5) 
k-i 

It can be shown (see, for example, p. 153 of Cinlar [1975]) that if the matrix 
[Py] is that of an irreducible aperiodic Markov chain, then the vector 
(\i x Xx X ,... f n m x~ x ) is uniquely determined to within a multiplicative fac- 
tor; furthermore, the components of this solution vector are all strictly 
positive. Conclude, therefore, that the assumed solution (1) and (3) is 
indeed the correct solution of Equation (2), provided only that the finite 
Markov chain defined by the matrix of transition probabilities [p {J ] is 
aperiodic and irreducible (which is not a severe restriction on the kinds of 
transitions allowed among stages of the network). 

Gordon and Newell's model subsumes as special cases the closed network 
models of earlier authors, such as Koenigsberg [1958] and Finch [1959]. 
Gordon and Newell show also that the open models of Exercise 8 ("open" in 
the sense that customers can enter and leave the system) and the closed 
models of this exercise are equivalent. For a discussion of numerical methods 
for computing the equilibrium distributions that describe networks of this 
type, see Buzen [1973]. 



10. The following model can be used to study talking and dialing equipment 
interactions in a modern private branch exchange telephone switching system. 
Calls arrive at random to two groups of servers, H 2 (time slots for talking) and Hi 
(digit trunks for dialing). If an arrival finds all servers in H 2 busy, the call is 
cleared from the system. If an arrival finds at least one idle server in each of H 2 
and H lt the call holds one server in each group simultaneously for an exponen- 
tially distributed time interval with mean u,f 1 (dialing time); it then releases the 
server in H t but retains the server in H 2 for an additional exponentially distributed 
time with mean u^" 1 (conversation length). If an arrival finds all servers in Hj 
busy and at least one idle server in H 2 , the call holds a server (waits) in H 2 until it 
gains access to an idle server in H } ; it then proceeds in the same manner as a call 
that initially finds an idle server in both H 2 and Hj. Write and solve an appropriate 
set of equilibrium state equations for this system. 

11. The following is a simplified version of a model that has been used to estimate 
the number of fire engines needed in New York City (Chaiken [1971]). 
Suppose that in a region of the city fire alarms occur according to a Poisson 
process with rate X. With probability pi an alarm will request the dispatch of a 
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single fire engine, which will be sufficient to handle the fire. In this case, the 
length of time the fire engine spends away from the firehouse is exponentially 
distributed with mean t(1). With probability p 2 an alarm will again request the 
dispatch of only a single fire engine, but this will prove inadequate and a 
second fire engine will be dispatched after an exponentially distributed time 
interval with mean t,(2). The two fire engines will then work together for an 
exponential time with mean t 2 (2), after which one of the fire engines returns 
to the firehouse and the other remains at the fire for an additional exponential 
"cleanup" time with mean t 3 (2). Finally, with probability p 3 an alarm will 
request the dispatch of two fire engines simultaneously. In this case the two 
fire engines will work together for an exponential time interval with mean 
Ti(3), after which one of the fire engines remains for an additional exponential 
cleanup time with mean t 2 (3). Thus there are three types of fires, each 
composed of a number of independent exponential phases — the type-1 fire, 
which occurs with probability p h has one phase with mean t(1); the type-2 
fire has three phases with means ti(2), t 2 (2), and t 3 (2); and the type- 3 fire has 
two phases with means tj(3) and t 2 (3). The total number of fire engines in the 
region is n. Any fire engines needed beyond these n will be provided from 
outside the region. Let P(iJ\jT.Ji,k x ,kd be the equilibrium probability that 
at an arbitrary instant there are simultaneously i fires of type l,y r fires of type 
2 in phase v {v =1,1,3), and k v fires of type 3 in phase v (p — 1,2); and let 
P m - S/P(i;>i J 2 Ji; K k 2 ), where / = { iJiJ 2 Ji,k x ,k 2 : / +j , + 2j 2 +> 3 + 2k x + k 2 
= m). Then i* = 2~„„P m is the probability that there will not be any idle fire 
engines in the region when an alarm occurs. Thus, calculation of 
P(i J\,j 2 ,h;k x ,k 2 ) allows one to estimate the number of fire engines required 
to keep P<e for any specified e. Show that 



P{i\j\JiJz,k x ,k 2 ) = 



[liMl)]' IpfrxOti* [*2XT 2 (2)r* [pfr&rf* 
»'! j\\ 7*2* h } - 

[ jP3 Xt 1 (3)]*' [ P3 \r 2 (3)] k > 



X 



k x \ k 2 \ 

Xexp{-[/ Jl XT(l)+^ 2 XT 1 (2)+/> 2 \T 2 (2)+^ 2 AT 3 (2) 
+/>3^ 1 (3)+/> 3 \t 2 (3)]}. 

(Chaiken also shows that, as one might expect, the same result holds when the 
phase lengths are not exponentially distributed.) 



43. Generating Functions (Overflow Traffic) 

As with state equations in one variable, generating functions are often 
useful in solving multidimensional state equations. It is interesting that the 
generating-function technique may be applicable even when some of the 
variables take on only finitely many distinct values. 

As an example of the use of generating functions for solving such a 
problem, we consider the case of a queueing system composed of an 
i-server primary group and an infinite-server overflow group. In this 
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Infinite-server 
overflow 
group 



s-server 
primary 
group 

T 

Offered load a = X//i erl 
Figure 4.4 

so-called simple overflow model, it is assumed that customers request 
service first from the ^-server primary group, and that all customers who 
find all servers busy in the primary group overflow to and are handled by 
the infinite-server overflow group. 

The system to be studied is represented by Figure 4.4. Let P{M=j, 
N= k} = P(J,k) be the statistical-equilibrium probability that M—j and 
N—k customers are simultaneously in service on the primary and overflow 
groups, respectively. The input is Poisson with rate X, and the service times 
are exponential with mean ju -1 . 

Following Kosten [1937], we shall obtain the generating function of the 
distribution {P(J,k)}, from which the following important formulas for the 
mean E(N) = a and variance V(N) = v are easily found: 

a = aB(s,a) (3.1) 

and 

v = a(l-<x+— ), (3.2) 

V s+l + a — a / 

where a = X/ /i is the offered load in erlangs and B(s,a) is the Erlang loss 
formula. 

These results form the basis for an important approximation technique, 
called the equivalent random method, which we shall discuss in Section 4.7. 
The derivation given here is instructive, but it is not the most elegant route 
leading to (3.1) (which is intuitively obvious) and (3.2). (See Brockmeyer 
[1954], Riordan [1956, 1962], Wallstrom [1966], and Descloux [1970]. The 
properties of overflow streams are discussed in Palm [1943] and 
Khintchine [1969]. Although (3.1) and (3.2) follow easily from Kosten's 
analysis, which we give here, according to Riordan they were first given 
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explicitly in independent unpublished memoranda (1941) by H. Nyquist 
and E. C. Molina.) Also, despite the elementary nature of these calcula- 
tions, they are involved. If the reader is willing to accept (3.1) and (3.2), he 
can skip the calculations without loss of continuity. 

The equations for the probabilities {PQ',k)} when j<s— 1 are 

(a +j + k)P(j, k) = aP{j -l,k) + (j+\)P(j+\,k) 
+ (k + l)P(j,k+l) 
[P(-hk)=0; y=0,l,...,j-l; £ = 0,1,...], (3.3) 

where tf = A/>. When an arriving customer finds all s servers in the 
primary group busy, he takes a server in the overflow group. Thus, for 
j = s, 

(a + s + k)P(s, k) = aP{s — l,k) + aP(s, k-\) 
+ (k+l)P(s,k+l) 
[P(s,-1) = 0; £ = 0,1,...]. (3.4) 

The system is completely described by Equations (3.3), (3.4), and 

2 I W,k) = l. (3.5) 
j=o k=0 

To solve these equations using generating functions, consider the aux- 
iliary system of equations obtained from (3.3) by ignoring the finiteness 
of s: 

(a +j + k)P(j,k) = aP(j -l,k) + (j+ \)P(j +l,k) 
+ (k + l)P(j,k+l) 
[P(-l,k) = 0; 7 = 0,1,...; *-0,l,...]. (3.6) 

We shall solve the set (3.6) in such a way that 

P{j,k) = P{j,k) (j = 0,l,...,s; £ = 0,1,...). (3.7) 

The values {P(J,k)} for j>s yielded by the equations (3.6) are of no 
consequence. (Observe that this is the same strategy used in Exercise 46 of 
Chapter 3.) 

Define the two-dimensional probability-generating function 

«/W)= I 2 PU,k)xJy k . (3.8) 
Substitution of equations (3.6) into (3.8) yields, after some straightforward 
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manipulation, the partial differential equation 



(1 - x)-^faj) + (1 -y) ^ fa,y) = «(1 - x)fa,y), (3.9) 



whose solution is 



(3.10) 



where G is an arbitrary function. (For discussions of partial differential 
equations of the form (3.9) see, for example, Garabedian [1964].) 

We now determine the function G such that the coefficients {P(J,k)} 
generated by \[/(x,y), which satisfy the state equations (3.3), also satisfy the 
boundary condition implied by Equation (3.4). Then Equation (3.7) will be 
satisfied. To this end, we first expand G in powers of (1 — y)/(\ — x). Then, 
from (3.10), 



JcoO-e-*'-"! <v(jz7)". (3 ") 



where the coefficients {c y } remain to be determined. 

Let us expand the term e~ a(1_Jc) (l — x)~" in Equation (3.1 1) into a power 
series in x : 



6 --<»-*>(i -*)-'- 2*0V. 

7-0 



where, as is easily verified, 



(3.12) 



<f>,0)= 



7T* 



i%\ I I {.1-1 



J-i 



('-0), 
(*-U,...). 



(3.13) 



Two useful identities are 



2 *wC/)-*w+i(») 
j-o 



(3.14) 



and 



O'+lKC/ + 0-^C/) + ^ + iC/). (3-15) 
To obtain Equation (3.14), we simply equate coefficients on both sides of 
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the identity 

e-«*-*\l- x )-' = e-« l - x \l-x)- lw+l) -e-« l -*\l-x)- iw+1) x. 
The result is 

<t>„0) = <f>,+ it/) -<fv+ i(J- 0> 

from which Equation (3.14) follows directly by summing both sides of this 
equation. Similarly, Equation (3.15) follows after we equate coefficients on 
both sides of the identity 

^e-d-^l-^-'-oe-^-^l-^-' + w— (1 -*>(l-jc)- (r+1) . 

Now consider Equation (3.7). When j — s, comparison of Equations (3.4) 
and (3.6) shows that if (3.7) is true, then 

aP(s,k-l) = (s+l)P(s+l,k) (A:=l,2,...) (3.16) 

and P(s+1,0) = 0. 
Let $j(y) be the marginal generating function 

i(y)= 2 Hj,k)y\ (3.17) 

so that 

*W)- 2 (3.18) 
j-o 

Then, introducing (3.16) into (3.17), we obtain 

wP,(y)'*(3+i)h + i(y). (3.19) 
From Equation (3.18) we see that 

*^)-^(t7^W)) (3-20) 

-/• \ ox J J x =0 

Using (3.12) in (3.11), we have 

*(*,y) = 2 <v0 2 fcOV- (3.2i) 
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It follows from Equations (3.20) and (3.21) that 



«O0- 2 cMM-y)'. (3-22) 

p-0 



In particular, 



l(y)= 2 c&wi-yy. (3.23) 

If we use the identity 
we can write 

AM- 2 cMW-yy- 2 c^WO-jO'* 1 . (3.24) 

Therefore, equating coefficients of (1— y) v in Equation (3.19) we have, by 
virtue of Equations (3.23) and (3.24), for v~ 1,2,..., 

ac p <f> p (s) - ac v _ = (s+ l)c^ + !)• (3-25) 

Comparison of Equation (3.25) with (3.15) when y' = s shows that 

^+lW"- flC r-A-l( J ) (*=1,2,...) 

or, solving recursively, 

^ = (-l)-7T — fg x, / x g o (f=l,2,...). (3.26) 

It remains to find c . 

Applying the binomial theorem to the factor (1—yY in Equation (3.21), 
we can write 

2 c v i (-i)*( 2 ^ov, 

where we define 

and which, after interchanging the orders of summation over v and k, 
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becomes 



k*,y)= 2 

k-0 



y k 2 fcOV- 

7-0 



Finally, interchanging the orders of summation over v and y, we have 



2 2 

fc=o y-o 



(3.27) 



Comparison of Equations (3.8) and (3.27) shows that 

P(M)=(-i)*2 (J)'AC/). 

f = A: 

which, in view of Equation (3.26), can be written 

v\a" <t>i(s)4> (s) 



*M)-(M)<2(-^)^f^,0K. 0.28) 



v-k 



We now determine c from Equation (3.7) and the normalization equa- 
tion (3,5). Summing first with respect to the index k, we have 

which, after interchange of the order of summation, becomes 

S f <-»)'f / !i M T *W 2 (-!)*([)■ 0.29) 

The last summation on the right-hand side of (3.29) is the binomial 
expansion of (1 — 1)*"; therefore 



2 (-i)*(J)-o {v>\). 

Since by definition ( q) ™ 1» Equation (3.29) becomes 



2 P{j,k) = c<& Q {j). 

A-0 



(3.30) 
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Summing both sides of Equation (3.30) with respect to j, we have 

2 2 P(J,k) = c Q t<t>o(J). (3-31) 

7 = k-0 j~0 

Using the normalization equation (3.5), it follows that 

1 

c o=— 

2 UJ) 

and finally, using the identity (3.14), we conclude that 

<V=-W- (3.32) 

Thus, in view of Equation (3.28), we have Kosten's explicit expression for 
the distribution { P(J, k) } : 



p(j,k)=(-\) k us)^(-\y(l)^ 



= 0,1,. ..,5; * = 0,1,...), (3.33) 

where {<£„(/')} * s given by Equation (3.13). 

Note that, according to Equations (3.30), (3.32), and (3.13), the marginal 
distribution of the number of busy servers in the primary group is, of 
course, the Erlang loss distribution 



f po^-xrr-'-r-r 0'-o,i,...,5). (3.34) 

We are now prepared to derive the required formulas (3.1) and (3.2) for 
the mean E(N) = a and the variance V(N) = v of the overflow distribution. 
It follows from Equation (3.17) that 



and therefore 



-AOt (3 - 35) 
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From Equation (3.22) we see that 

which, in view of (3.26) and (3.32), becomes 

Using (3.36) in (3.35) we have, in light of the identity (3.14), 

The required result (3.1), which of course could have been anticipated on 
other grounds, follows from (3.37). 
To calculate v = V(N\ note that 

j-o\dy 2 

Since V(N) — E(N 2 ) — E 2 (N), we have 

2 [4i*aA =^ + « 2 "«- (3-38) 
From Equation (3.22), we see that 



($H. - 



2c 2 <t> 2 (j) 0-0,1,. ..,5). (3.39) 



Therefore 



v + a 2 -a = 2c 2 , 2 <f> 2 (y). (3.40) 
J-o 

Using the identity (3.14), we have 

v + a 2 -a=2c 2 <f> 3 (s). (3.41) 

Equation (3.2) now follows directly from (3.41). 

The important concept to be digested here is in the use of probability- 
generating functions to solve multidimensional state equations which, at 
first glance, do not seem amenable to these techniques because of the 
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existence of certain boundary conditions. The key steps are the extension 
of the original state equations so that the generating-function technique 
can be used, the expansion of the generating function in a power series, 
and the calculation of the unknown coefficients by forcing the generating 
function to agree with the previously ignored boundary conditions. 

It is interesting to speculate whether the preceding results are valid 
independently of the form of the service-time distribution function. In 
particular, is the formula (3.2) for the variance V(N)=v of the overflow 
distribution valid in general for systems with Poisson input? One might 
suspect so, since both the distribution of the number M of customers on 
the primary group and the distribution of the total number M+N of 
customers in the system are known to be independent of the form of the 
service-time distribution function. Nevertheless, as Burke [1971] has de- 
monstrated in a study of the simple overflow model with constant service 
times, this is not the case. Of course, (3.1) remains true, since it gives the 
mean number of customers per service time who find all primary servers 
busy. 

Overflow models are of both theoretical and practical interest, and have 
received much attention throughout the past 50 years. Syski [1960] 
summarizes most of this work, including practical applications. Riordan 
[1962] and Khintchine [1969] also discuss and summarize various aspects 
of overflow problems. Important papers not discussed or summarized in 
these books include Burke [1971], Descloux [1963, 1970], Neal [1971], and 
Wallstrom [1966]. (See also Section 4.7 and Exercise 12.) 

Exercise 

12. Apportioning the moments of the overflow distribution. Suppose that customers 
arrive in n independent Poisson streams with rates \ x ,...,\ at a single 
primary group of servers and, if all servers in the primary group are busy, are 
served on a single infinite-server overflow group. Let jw ~ 1 be the mean service 
time, and set a, =\//i {i = 1,2, ...,«). Let N t be the number of customers from 
the ith Poisson stream present on the overflow group at an arbitrary instant in 
equilibrium. Then, with a = a } H + a n and N~N x -\ + JV„, 

E(N ( )=^E(N), (1) 

F(JV '> = ( 7T I' V{ N) + « ( 1 ~ a ) E(N) ' (2) 
Cov< Ai, Nj) = ( ^ )( |)[ V(N) - E(N)]. (3) 

For simplicity, we shall prove the results (l)-(3) for the case n=2 only. 

a. Assume exponential service times. Let M be the number of customers in 
service on the primary group, and define h(J,ki f k^) = P {M=j, N x = k u 
N z = k 2 }^ Write the equilibrium state equations for the distribution 
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{h(J t k lf k^)}, and show that the solution is 

HM„* 2 )=m<)(^r(ff, 

where P(J>k) is given by Equation (3.33). [Hint: Substitute the above 
expression for h(J,ki,k 2 ) into the state equations, and show that Equations 
(3.3) and (3.4) result.] 
b. Show that 

c Show that 

E(N i \N=k) = k~, (5) 

E(N*\N- *)-*(*- 1)( ^ )* + *^ . (6) 

d. Show that (5) implies (1). 

e. Show that (6) implies 

e ( n 'H^) 2e(n2 ^){ 1 -^) e(n) - (7) 

f. Show that (7) implies (2). Define the peakedness factor for any stream as its 
overflow-distribution variance-to-mean ratio, and show that Equation (2) 
can be written 

z-\= Pi (z-\), (8) 

where />, is the proportion of customers in stream i, and z ( and z are the 
relevant peakedness factors. 

g. Continue the argument of parts b-f to prove (3). 

h. Conclude that, for exponential service times, the moments of the overflow 
distribution corresponding to the /th Poisson stream (at least for n=2) are 
given by Equations (1), (2), and (3) with E(N) = a given by (3.1) and 
V(N) = v given by (3.2). 

These results, which are useful in the application of the equivalent random 
method, were originally obtained by A. Descloux [1962, unpublished] and 
Lotze [1964]. The following argument shows that the assumption of exponen- 
tial service times is not necessary for the validity of Equation (4), and 
therefore the results (l)-(3) are valid for arbitrary service- time distribution 
function: Because the original streams of traffic are Poisson, each sees the 
same probability of blocking on the primary group. Hence, given that a 
customer is blocked, the probability that mis customer is from stream i 
is the ratio a,/ a. Further, this probability is independent of the composition 
of the other customers in service on the overflow group. Therefore, given the 
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number of simultaneous customers in service on the overflow group, the 
distribution of the number of customers of type / is the binomial with 
proportion Oj/a, given by Equation (4). It is interesting that the formulas 
(l)-(3) are valid for an arbitrary service-time distribution function, whereas 
the moment V(N) [but not E(N)] depends on the distribution of service times. 
[If the service times are exponentially distributed, then V(N) = v is given 
by Equation (3.2).] 

4.4. Macrostates (Priority Reservation) 

In all the examples so far, the naming of the states of the system has been 
quite natural in the context of the problem, and our attention has been 
directed toward methods of solution of the corresponding equilibrium state 
equations. In this section we observe that it is sometimes possible and 
convenient to consider groupings of the "natural" states (microstates) into 
macrostates. 

For example, consider a system in which a group of s servers handles 
two independent Poisson streams of traffic, where customers in stream 1 
are designated high-priority customers, and those in stream 2 are desig- 
nated low-priority customers. A high-priority customer will be cleared 
from the system if on arrival he finds no idle servers, whereas a low-prior- 
ity customer will be cleared from the system if on arrival he finds less than 
n + 1 idle servers. Service times for both high- and low-priority customers 
are assumed to be exponentially distributed with the same mean. We wish 
to find the probability of blocking experienced by the customers of each 
priority class. 

One approach to this problem is to describe the system by the two- 
dimensional states E Jij2 , where j x and j 2 are, respectively, the numbers of 
high-priority and low-priority customers in service; and then write and 
solve the "rate out = rate in" state equations for the corresponding distribu- 
tion {PUxJi))- If we define 

Pj= 2 PU1J2) 0=o,i„. ..,5), (4.i) 

then P s is the probability of blocking seen by the high-priority customers, 
and L S j_ s _ n Pj is the probability of blocking suffered by the low-priority 
customers. The indicated calculations are tedious and, as the following 
illustrates, unnecessary. 

Consider now the grouping of all those two-dimensional microstates 
E ivj with the property that j\+j 2 =j into a single one-dimensional macro- 
state Ej. That is, we define the macrostate E } as the set of all microstates 
Ej j that correspond to a total of j customers in service. It should now be 
clear that the probability corresponding to the state Ej is P p and that the 
distribution {Pj} is easily found from the one-dimensional "rate up — rate 
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down" state equations (where \ l and A 2 are the arrival rates and n 1 is the 
mean service time): 

(*! + \ 2 )Pj_ l =JnPj 0=1,2,...,*- n), (4.2) 

K p j- i -Jl&j (j = s~n+l,s-n + 2,...,s), (4.3) 

The solution, found easily by recurrence, is 
(a j + a 2 Y 



p J = 



P Q (j=\,2,...,s-n), 

p o (j=s-n + \,s-n+2,...,s), 



( fll +g 2 y-"W (J ~" ) 

Jl 



(4.4) 



where a^Xj/fx is the offered load, in erlangs, from the /th stream, and 
where, from the normalization requirement, 



\Jfc-0 K - k = s-n+\ K ' } 



It is worth pointing out that the example considered here is identical to 
Exercise 43 of Chapter 3. A reader with no knowledge of multidimensional 
birth-and-death processes would not make the mistake of doing this 
problem the hard way. Of course, if the mean service times of the 
high-priority and low-priority customers were unequal, the one-dimen- 
sional macrostate approach would not apply, and the advantage would lie 
with the more advanced reader. 

In the above example, the fact that a macrostate approach is possible is 
almost obvious, and once the proposed macrostates are identified, the 
validity of the results is clear. This is not always the case. When a valid set 
of equilibrium macrostate equations exists, the macrostate equations can 
always be derived from the ordinary equilibrium microstate equations. In 
general, however, it is arguable whether it is easier to write down directly 
an appropriate set of macrostate equations, or to derive them (if indeed 
they exist) from the more detailed but sometimes conceptually simpler 
ordinary microstate equations. 

As an illustration of these difficulties, we consider a model that is a 
generalization of the model considered above. Imagine a priority reserva- 
tion system with s identical exponential servers, where customers arrive in 
s independent Poisson streams, S X ,S 2 ,...,S S . Customers in stream S, may 
start service only when / — 1 or fewer servers are busy. All customers wait 
as long as necessary to obtain service. (If we denote by A,- the arrival rate 
of the customers in stream S if then this model reduces to the j-server 
Erlang delay model when A, = A 2 = • • • =A y , = 0.) 
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Let Ej. nxi n be the (micro-) state corresponding to the event {j servers 
are busy and n ( customers from stream S { are waiting, /= 1,2,... J, j — 
1,2,..., s), and let P(j;n 1 ,...,n J ) be the corresponding equilibrium proba- 
bility. It is tedious but straightforward to write the conservation-of-flow 
equations that determine these probabilities; however, it is quite another 
matter to solve them. 

Now consider the macrostates Ej O' = 0, l,...,,s), corresponding to the 
events {j servers are busy}, and let {P(J)} be the corresponding 
equilibrium distribution. Clearly, 

P(J) = S P(j;n l ,...,n J ), 

n u ...,nj 

and P(J) + P(J + 1)H + P(s) is the blocking probability for stream Sj. 

We now argue that an appropriate application of the principle of 
conservation of flow leads to the following equations for the macrostate 
equilibrium probabilities: 

(A,+ ... +\)P(j-l) + X,[PU)+--' +P(s)]=j l iP(j) = 1,2,. ..,,). 

(4.6) 

Equation (4.6) and the normalization equation, 

j = 

permit successive calculations of the distribution {P(J)} whenever the 
offered loads a v ...,a s {a t =\/ p) are such that statistical equilibrium 
obtains. 

To see the truth of (4.6), observe that the term on the right-hand side 
equals the equilibrium rate of transition from Ej to Ej_ x , including those 
transitions that are instantaneously reversed because a waiting customer 
seizes a just-released server. On the left-hand side of (4.6), the term 

Q^-\ +\)P(J— 1) equals the equilibrium rate of transition from Ej_ x to 

Ep including only those transitions that are not involved in instantaneous 
reversal; the term \j[P(J) + • * • + P(s)] equals the equilibrium rate of 
transition from Ej_ x to E p including only those transitions that instanta- 
neously reverse previous transitions from Ej to Ej_ v (The quantity \j[P(J) 

H + P(s)] equals the rate at which arrivals from 5^. join the queue. 

Eventually, each such customer will seize a server whose release caused the 
transition Ej-*E J _ l ; the just-released server will be seized instantaneously 
by a waiting customer from Sj, thereby causing the transition E J _ 1 -±E J .) 
Hence, the left-hand side of (4.6) equals the equilibrium rate of transition 
from E J _ 1 to E p and the right-hand side equals the equilibrium rate of 
transition from Ej to E J _ l ; that is, rate up equals rate down. 
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A. Descloux [1969, unpublished] has rigorously derived (4.6) directly 
from the conservation-of-flow equations that determine the microstate 
probabilities {P(J;n l ,... t n J )}. Descloux's complicated analysis further 
yields the conditions that must be satisfied if statistical equilibrium is to 
obtain (as we have tacitly assumed). Interestingly, the equihbrium condi- 
tions turn out to be polynomial functions of the offered loads, this 
nonlinearity being a reflection of the interaction among the different 
streams of traffic. For instance, when j = 3 the equilibrium condition for 
stream S x is 

6a l +3a 2 + 2a 3 + a l a 3 — 6<0. 

Exercise 

13. Show that when a x — a 2 — • • ■ = q,_ i = and a s ~ a, then Equation (4.6) yields 

P(s) = C(s,a), 
where C(s,a) is the Erlang delay formula. 

4.5. Indirect Solution of Equations (Load Carried by Each Server 
of an Ordered Group) 

We have seen that in many cases of practical interest, the probability state 
equations are easy to specify but difficult to solve in closed form. Such 
problems usually call for simulation (which we shall discuss in a subse- 
quent chapter) or numerical methods (which we shall discuss in the next 
section), but occasionally a closed-form solution can be obtained by 
indirect means. 

We consider again the question of systems with ordered hunt as de- 
scribed previously in Section 3.3: The servers are numbered 1,2,...,,?, and 
each arriving customer takes the lowest-numbered idle server. As in 
Chapter 3, we assume that customers arrive according to a Poisson process. 
However, we replace the assumption that blocked customers are cleared by 
the assumption that blocked customers may wait in the queue and defect 
from it at any arbitrary rate; this necessitates (if the problem is not to be 
intractable) the further assumption of exponential service times. For this 
model we calculate, as before, the load carried by the y'th ordered server 
(j = 1,2, . . . ,s), which is the same as the equilibrium probability that the jth 
ordered server is busy. 

As a first step, let us review the results of Section 3.3 for the Erlang loss 
model with ordered hunt. We defined the random variables Xj (J — 
1,2,...,.?), with ^. = when the yth ordered server is idle and 1 when 
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the yth ordered server is busy. Then, defining pj — P{Xj = 1}, we showed 
[Equation (3.18) of Chapter 3] that 

Pj-a[B(j-ha)-B(J,a)], (5.1) 

where a is the offered load and B(k,a) is the Erlang loss formula. Equation 
(5.1) was interpreted to say that the load pj carried by the yth ordered 
server in an Erlang loss system is the difference between the load that 
overflows the (J— l)th ordered server and that which overflows the yth 
ordered server. 

This model was extended in Section 3.4 to include the case where all 
blocked customers wait until served. There we presented a clever intuitive 
argument, first advanced by Vaulot [1925], that shows that the equilibrium 
conditional probability P{Xj=\\N<s) that the jih ordered server is busy, 
given that there are no customers waiting, is given, remarkably, by [see 
Equation (4.14) of Chapter 3] 

P{Xj=l\N<s}=Pj = 1,2,. ..,j), (5.2) 

where p p given by (5.1), is the corresponding probability for the system in 
which no waiting is allowed. This result, combined with Equation (4.13) of 
Chapter 3, leads to the required result, namely, 

Pj = a[BU-ha)-B(j,a)]P{N< S }+P{N>s}, (5.3) 

which we used in Exercise 19 of Chapter 3 to calculate the most economi- 
cal division between flat-rate and measured-rate trunks in an Erlang delay 
system. 

Our present objective is to give a more algebraic proof of (5.2) that is of 
some interest in itself. For the sake of generality, we no longer restrict 
ourselves to the assumption that blocked customers wait until served, but 
instead we allow any queue discipline that can be described in the 
framework of a multidimensional birth-and-death process. (That Vaulofs 
argument remains valid for any birth-and-death model with A,=A and 
liy=jn (j-0,l,...,s) was observed by P. J. Burke [1963, unpublished].) To 
this end, we again begin by considering the Erlang loss system with 
ordered hunt, and we define 

P(^,...,x J ) = J P{Jir 1 = ^ 1 ,...,^ = ^} (5.4) 

and 



(5.5) 
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Then, using the principle of conservation of flow, we can write, for all 
Xj = or 1 except x x = * • • = x s = 1, 

+ +x s )n]P(x 1 ,...,x s ) 

s 

= ^2 SO*»*i+" - +X J )P(x l ,...,Xj_ lt Xj-l,Xj+ l ,... t X s ) 

J-1 

s 

+ /*2 8(0,x J )P(x l ,...,x j _ l ,x j +l,Xj +i ,...,x s ); (5.6) 
and when x x = • • • = x 5 = 1 , 

5 

jj^P(l,..., 1)=A 2 ' ' • +Xj)P(x l ,...,Xj^ l ,Xj-\,x J+l ,...,x s ). 

(5.7) 

The 2* unknown probabihties {P(jci,...,jc 4 )} can be found from the set 
(5.6) of 2 s — 1 independent equations and the normalization equation 

2 JW", *,)-!, (5.8) 

where the summation is carried out over all 2 s probabilities. [Equation (5.7) 
can be obtained by adding all the equations in (5.6), and is therefore 
redundant.] 

Our objective is to find p p the load carried by the y'th ordered server, 
which is given by 

Pj = 2 H*\> • • ->Xj_ „ \,x J+ !, . ..x s ), (5.9) 

where the sum is taken over all values of the arguments x x , 
,,, i Xj_ l ,x J+l ,.,.,x s . Observe, however, that it is not necessary to solve the 
equations (5.6) and evaluate the right-hand side of (5.9) in order to 
calculate py, we have already rigorously shown, in Chapter 3, that pj is 
given by (5.1). We now use this observation to prove that, as asserted, 
Equation (5.2) is valid for the more general model that allows queueing, 
thereby proving (5.3). 

To this end, let Q be the number of customers waiting in the queue, and 
define 

P(x v ...,x s ;k) = P{X 1 = x v ...,X s = x s , Q = k). (5.10) 
Now observe that, for k = and not all x t = 1, the probabilities defined by 
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(5.10) satisfy the same conservation-of-flow equations (5.6) that are satis- 
fied by the probabilities (5.4) for the Erlang loss system with ordered hunt. 
When k = and all x i ; = 1, then the corresponding flow equation is 

(A + j/i)P(l,...,l;0) 

s 

=A 2 8(J»Xi +,mm +Xj)P(x l ,... t Xj_ l ,x J -\ i Xj +l ,...,x t ;0) 

+ /t, +1 />(l,...,l;l). (5.11) 

It is also true (because "rate up = rate down") that 

Ai»(l,...,l;*)- ft+it+1 i^l,...,l;fc + l) (A:=0,1,...). (5-12) 

(Note that this model allows the departure rate ^ to be arbitrary when 
j>s. The Erlang delay model is the special case in which fy = sn when 
j>s.) Substitution of (5.12) with k=*0 into (5.11) yields an equation 
identical with (5.7). Thus we have shown that the probabilities 
{P(x l ,...,x s )} and {P(x l ,...,x a ;0)} satisfy the same equations (for all xj). 
From this important observation we can conclude that the probabilities 
{P} and {P} are proportional (by c, say) when & = 0: 

P(x v ...,x s ;0) = cP(x v . . . , x s ). (5. 13) 

Now, applying the definition of conditional probability, we can write 

, P{Xj=hN<s} 
ft^-HAfO}- '; {N<s} ' . (5.14) 

But 

P{Xj-l,N<s}-^Iix v ...,Xj_ lt hxj +l ,... t x; t O) (5.15) 

and 

P{N<s} = ^ l P(x i ,...,x s ;0). (5.16) 
Substitution of (5.13) into (5.15) and (5.16) yields 

P{X J = \ y N<s} = c^ d P(x v ... i x j _ l Xx J+l ,... i x s ) (5.17) 

and 

P{N<s} = c'2P(x 1 ,...,x s ). (5.18) 
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Comparison of (5.17) with (5.9) shows that 



P{X J =\,N<s} = cp J ; 



(5.19) 



and comparison of (5.18) and (5.8) shows that 



P{N<s} = c. 



(5.20) 



Substitution of (5.19) and (5.20) into (5.14) yields (5.2), and the proof is 
complete. 

Note that nowhere did we solve any complicated set of conservation-of- 
flow equations. The problem was solved by recognizing that the set of 
equations whose solution we required had essentially the same solution as 
another set, namely (5.6) and (5.7); and, further, the required information 
could be extracted from (5.6) and (5.7) without solving these either, but 
rather by applying an elegant argument [that of Section 3.3 to obtain (5.1)] 
that is valid only for the original Erlang loss model. For an analysis of a 
similar model with heterogeneous exponential servers (exponential servers 
with different rates) see Cooper [1976]. 

The moral of this section is that there is more than one way to skin a cat. 



14. Consider again the premise of Exercise 7 (heterogeneous exponential servers 
selected in random order), but assume now that all blocked customers wait as 
long as necessary for service. Let Q be the number of customers waiting in the 
queue; define P(x i ,...,x/,k) = P{X l — x i ,... t Xs = x Jt Q = k) and p=X/(/i, 
+ • • • + /i,); and show that if p< 1, then 



These results were first obtained, in somewhat different form, by Gumbel 



Exercise 



P(x u . . x s ; 0) « cP(x x , . . ., Xj) 



and 



P(l,...,Uk) = cp k P(l,...,l), 



where P(x i,..., x,) is given in part b of Exercise 7 and 




[I960]. 
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4.6. Numerical Solution of State Equations by Iteration 
(Gauss-Seidel and Overrelaxation Methods) 

Thus far we have seen that queueing problems often lead to sets of 
multidimensional probability state equations. In each of the cases consid- 
ered so far in this chapter, we have been able to solve these equations 
without recourse to numerical methods. In most cases of practical interest, 
however, product solutions, generating functions, or astute observations 
will not lead to neat solutions. Fortunately, equations of the type we are 
concerned with often yield easily to a simple numerical iteration scheme. 
Convergence criteria are known, but are often difficult to apply in practi- 
cal cases. However, real queueing problems in which this iteration tech- 
nique fails rarely, if ever, occur. Because of its simplicity, its particular 
suitability for digital-computer application, its apparent reliability, and the 
lack of anything better, we shall briefly describe an iteration procedure 
and its generalization and illustrate their application to the solution of 
multidimensional probability state equations. 

Numerical analysis is a subject in itself. The present discussion is 
intended to provide only a starting point for one who is interested in the 
theory of numerical solution of large systems of linear equations. Hope- 
fully, the following necessarily terse discussion will enable the reader to 
"get numbers" from multidimensional probability state equations. It is 
worth mentioning that the iteration procedure we shall describe is much 
simpler to apply than might appear from the mathematical description. 
The numerical example that follows the theoretical description should 
make this clear. 

Consider a general set of linear equations 



where A is a given square matrix, b is a given vector, and x is an unknown 
vector. [In the case of interest here, the components of x are the 
equilibrium state probabilities that are to be found; (6.1) is the set of 
equilibrium state equations.] Equation (6.1) can be rewritten in the form 



where L and U are, respectively, lower and upper triangular matrices with 
zeros along the main diagonal, and I is the unit matrix, with ones along the 
main diagonal and zeros everywhere else. Equation (6.2) can be written 



Ax=b, 



(6.1) 



(I-L-U)x=d, 



(6.2) 



x = Lx + Ux + d, 



(6.3) 
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which suggests the iteration scheme 

x ( n + i) = Lx ( n +i) + Ux (» ) + dj ( 6A y 

where the vector x (w) is the «th iterate, n = 0, 1,..., and x (0) is an arbitrary 
vector. Equation (6.4) simply says calculate each component of the vector 
x (n+1) based on the latest calculated values available, which are all the 
components of the vector x (n + ^calculated thus far and the remaining 
components of the vector x w calculated at the last stage of iteration. The 
iteration procedure (6.4) has associated with it the names Seidel, Gauss, 
Liebmann, Nekrasov, and probably several others. The general technique 
of solving equations by iteration is also called successive approximation and 
relaxation. The most common name of the scheme (6.4) is Gauss -Seidel 
iteration. 

It can be shown that a necessary and sufficient condition for the 
convergence of the iteration procedure (6.4) for arbitrary x (0) is that all the 
eigenvalues of the iteration matrix M = (I-L) _1 U lie inside the unit circle; 
that is, the iteration procedure (6.4) converges independently of the initial 
vector x (0) if and only if the spectral radius of the iteration matrix is less 
than unity. 

This convergence criterion is ordinarily of theoretical interest only, since 
finding the spectral radius of the iteration matrix is usually of the same 
order of difficulty as solving the original equations (6.1). It is desirable to 
be able to predict behavior of the iteration scheme directly from inspection 
of the original matrix A. This is possible in certain special cases. In 
particular, a sufficient condition for convergence of the iteration scheme is 
that the original matrix A be irreducible and exhibit weak diagonal domi- 
nance; that is, if a iy is the element in row / and column j of the irreducible 
matrix A, then convergence is assured if 

M> I W O-U,...,*), 

7=1 

and for at least one / 

k 

Wu\> 2 Kl> 

where k is the number of rows (columns) of the matrix A. This condition is 
not usually met by probability state equations, but it does suggest why this 
iteration scheme has proven useful in this context. The greater the con- 
centration of "mass" of the matrix A along its main diagonal, the more 
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likely (and faster) will the procedure converge. State equations of the type 
we are concerned with are characterized by a large number of zero 
elements off the main diagonal, making convergence likely, and simulta- 
neously making computer programming particularly easy. 

A generalization of the Gauss-Seidel method that has proven useful is 
the so-called method of successive overrelaxation, in which the overrelaxa- 
tion values are obtained by weighting the most recent value by a factor 
w> 1: 

X (" +l) = co[Lx (n + I) + Ux ( ' ,) + d] +(1 -<o)x (n > (w > 1). (6.5) 

The Gauss-Seidel method is thus the special case of overrelaxation for 
«=1. An intuitive feel for the reason overrelaxation sometimes speeds 
convergence can best be gained by running through a numerical example. 
Mathematically, the use of a weighting factor co will speed convergence if 
the corresponding iteration matrix has a smaller spectral radius. 

Experience indicates that overrelaxation with w?wl.3 is often faster than 
simple Gauss-Seidel iteration. Many fundamental questions about conver- 
gence and rates of convergence remain unanswered. The interested reader 
should see Young [1971]. Recently, Brandwajn [1979] has proposed other 
iterative methods that may be superior (with respect to assurance of 
convergence and rate of convergence) to the Gauss-Seidel and overrelaxa- 
tion methods. Related numerical methods are discussed in Buzen [1973] 
and Wallace [1973]. 

From a practical point of view, it appears that the Gauss-Seidel, overre- 
laxation, and similar iterative methods (such as those proposed by Brand- 
wajn) are the most useful methods for numerical solution of large sets of 
equations of the type that arises from analysis of multidimensional birth- 
and-death processes. These methods are especially suitable for implemen- 
tation on a digital computer. (Also, a point that was of utmost importance 
only a short time ago, when computers were relatively rare and expensive, 
is that iteration schemes that converge for any starting vector x (0) are 
self -correcting; an error is "absorbed" by the method and results in only a 
larger number of iterations. Anyone who has tried to solve systems of 
equations on a calculator will surely appreciate this fact.) 

We now give a simple illustration of the use of the Gauss-Seidel and 
overrelaxation methods in the solution of multidimensional statistical- 
equilibrium probability state equations. 

Consider two server groups, each of which receives direct Poisson traffic 
and also overflow traffic from the other group (see Figure 4.5). Let the 
server groups G x and G 2 be of sizes s x and s 2 , with Poisson loads of 
magnitudes a x erl and a 2 erl, and exponential service times with mean ft -1 . 
Customers who find all s l + s 2 servers busy are cleared from the system. 
Let PU1J2) oe th e statistical-equilibrium probability that there are j x 
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a 2 



I t 



G 2 



S2 servers 



I t 



Sj servers 



I t 



Figure 4.5 



customers in service on G x and j 2 on G 2 . Then, equating rates of flow out 
of and into each state, the statistical-equilibrium probability state equa- 
tions are 

- a x P{j x - \J 2 ) + a 2 P(j x J 2 - 1) 

+ Ui + 1 Wi + 1 J2HO2 + 1)^0W2+ 

0'i<*i> ji<Si), (6-6) 

(tf, + d 2 + S x +j 2 )P{s x j 2 ) 

= a x P(s x - l,j 2 ) + a l P(s 1 J 2 - 1) 

+ a 2 P{s x J 2 - 1) + (J 2 + \)P(s l J 2 + 1) 

t/i"*i» h <s 2 )* (6-7) 

(a x + a 2 +j x + s 2 )P(j x ,s 2 ) 

= <* x P(j x - \,s 2 ) + a 2 P(j v s 2 - 1) 

+ a 2 P(j x - l,s 2 ) + 0', + \)P(Ji + U 2 ) 

(Ji<Si> h = s 2 \ (6-8) 
(s x + s 2 )P(s x ,s 2 ) = (fli + a 2 ) p (si ~ + ( a i + a 2 )P(s x ,s 2 - 1) 

Ol=-S]» y2 = ' s 2)- (6.9) 

The normalization equation is 



2 £ rUM-i- 
A-0J2-0 



(6.10) 
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Table 4.1 
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We shall solve these equations for the particular case s x = 4, a x - 5, s 2 = 2, 
a 2 = 1. The total number of states is (s x + 1)(j 2 + 1)= 15. Consider the state 
equations (6.6)-(6.9). For reasons that will be explained presently, we 
reorder these equations so that their matrix of coefficients is as shown in 
Table 4.1. All the entries on the main diagonal are positive, and all those 
off the main diagonal are negative or zero. Note how the nonzero elements 
cluster along the main diagonal. 

Observe that this 15 X 15 matrix is of rank 14; that is, each column sums 
to zero and any one of the rows can be derived from the other 14. In other 
words, one of the equations is redundant, and the other 14 equations 
determine the solution only up to a constant factor. This factor is supplied 
by the normalization condition (6.10). 

Although one of the equations is redundant, it appears that the conver- 
gence of the iteration procedure sometimes is accelerated by using all the 
state equations, as we shall do in this example. The normalization equation 
may be used after each complete round of iteration, as we do in this 
example, or it may be used only after the last stage of iteration. When all 
the state equations are used, normalization is accomplished by summing 
the unnormalized values and dividing each by their sum. Normalizing only 
after the last stage of iteration is more efficient as long as computer 
overflows and/ or underflows do not occur, since the relative values are 
unaffected by normalization. (We normalize after each round in this 
example because we want to display the values at each step.) 

We now apply the method of successive overrelaxation (6.5) to our 
matrix of coefficients, using w=1.0 (the ordinary Gauss-Seidel iteration), 
1.1, 1.2, 1.3, and 1.4. As first step, we must specify the initial values 
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Let us first consider ordinary Gauss-Seidel iteration (w=1.0) with the 
initial values all equal. Then i ,(0) 0',j 2 ) = =0.06667. Following the order 
indicated by the matrix, we first calculate i >(1) (4,2) according to 

6P (1 >(4, 2) = 6P<°>(4, 1) + 6P (0 >(3, 2). 

We next calculate P°\4, 1) according to 

1 1 i> (1 >(4, 1) = 2 P< »>(4, 2) + 6i> (0) (4, 0) + 5P (0 >(3, 1). 

We continue in this manner, calculating J» (l) (4,0), J P (1) (3,2),..., J P (1) (0,0). 
We may then normalize these values (divide each one by their sum) and 
start the second round of iteration, or we may start the next round of 
iteration immediately and normalize at completion of the iteration proce- 
dure. For this example, the normalized probabilities after k—0, 1, 2, 5, 11, 
12, and 13 stages of iteration (to= 1.0) are as shown in Table 4.2. 

Observe that the probabilities change only slightly from the twelfth 
iteration to the thirteenth. Somehow it must be decided when to stop 
iterating, that is, when the procedure has converged (or shown that 
convergence is not occurring). In the present example the convergence 
criterion was taken to be 

That is, we assume that the process has converged when the sum of the 
absolute changes of the probabilities from one iteration to the next is less 



Table 42 





k~0 


Jfe-1 


fc-2 


k=*5 


Jfc-11 


Jt-12 


fc-13 


i><*>(4,2) 


0.0667 


0.12500 


0.16304 


0.23077 


0.26404 


0.26444 


0.26466 


P<*>(4, 1) 


0.0667 


0.08523 


0.08893 


0.12338 


0.12464 


0.12455 


0.12449 


P< k \4,0) 


0.0667 


0.03977 


0.03866 


0.04774 


0.03978 


0.03960 


0.03949 


/»<*>(3,2) 


0.0667 


0.08523 


0.10316 


0.12751 


0.13995 


0.14019 


0.14033 


i»<*>(3, 1) 


0.0667 


0.08864 


0.10358 


0.12392 


0.12059 


0.12056 


0.12054 


V<*>(3,0) 


0.0667 


0.06225 


0.06761 


0.06623 


0.05431 


0.05410 


0.05399 


P w (2,2) 


0.0667 


0.06932 


0.07597 


0.05920 


0.06067 


0.06076 


0.06082 


/»<*>(2, 1) 


0.0667 


0.08662 


0.10412 


0.08036 


0.07458 


0.07455 


0.07454 


?^(2,0) 


0.0667 


0.07323 


0.08478 


0.05100 


0.04162 


0.04149 


0.04141 


P w (l,2) 


0.0667 


0.06402 


0.03595 


0.01954 


0.01876 


0.01878 


0.01880 


P<*>(1, 1) 


0.0667 


0.08543 


0.05956 


0.03260 


0.02925 


0.02923 


0.02922 


P ( *>(1,0) 


0.0667 


0.07764 


0.04448 


0.02312 


0.01901 


0.01896 


0.01893 


P<*>(0,2) 


0.0667 


0.01581 


0.00755 


0.00332 


0.00305 


0.00305 


0.00305 


P ik \0, 1) 


0.0667 


0.02552 


0.01301 


0.00639 


0.00564 


0.00563 


0.00563 


P<*>(0,0) 


0.0667 


0.01719 


0.00958 


0.00492 


0.00411 


0.00410 


0.00409 
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than 0.001. This criterion, which is completely arbitrary, is satisfied in our 
example at k = 13. 

If we had used an to different from the Gauss-Seidel value of unity, we 
might have speeded up the convergence; that is, we might have satisfied 
our arbitrary convergence criterion after less iterations. For this example 
use of an overrelaxation factor o>=1.3 will produce convergence (as 
defined above) for the same initial values after k = 9 iterations, a saving in 
computation time of about -^?»31%. 

Another way to speed convergence is to start out with a set of initial 
values closer to the solution. That is, use some intelligence in specifying the 
initial values, instead of choosing them in a completely arbitrary fashion. 
In the present example, we know that a product solution of the form 

will satisfy the equilibrium state equations everywhere except on the 
boundary. Therefore it seems reasonable that the true solution should not 
differ numerically as much as from the product solution values as from the 
arbitrary value . 
For the example under consideration, using the product 

where c is the normalization factor, leads to convergence (with respect to 
our criterion) in 10 iterations when w= 1.0, and in 6 iterations when w= 1.3 
[as opposed to 13 and 9 when the initial values are P^XjiJ^^js]- Speeds 
of convergence (numbers of iterations required) for this example for 
various values of the overrelaxation factor w and different initial values are 
summarized in Table 4.3. 

The reason for ordering the equations as we have done, starting with the 
boundary states, should now be apparent. If, when using the product 
( a i'//i0(^2 2 //2!) c as the initial vector, we had started the iteration proce- 
dure at interior states, then no change in the calculated values would have 
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CO 


1.0 


1.1 


1.2 


1.3 


1.4 






10 


8 


6 


6 


8 


i 

15 




13 


11 


9 


9 


11 
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occurred until a boundary equation (where the product solution does not 
hold) was reached. Thus the computation time of the first round of 
iteration would have been largely wasted. 

Another useful observation is that we know, in this example, that 
^•j l +j 2 =jPU\ji) = Pp where [Pj) is the Erlang loss distribution (3.3) of 
Chapter 3. In particular, we can calculate P(0,0)~P Q and P(4,2) = P 6 ; 
these can be used as starting values or, if they are not, can serve to indicate 
the degree of convergence achieved after a finite number of iterations. For 
example, i>(4, 2)= P 6 = B(6,6) = 0.26492; comparison with Table 4.2 shows 
that jP(4,2)-J >(13) (4,2)| = 0.00026. 

These tricks may seem hardly worth the effort for the simple example 
considered here. But for the solution of large sets of equations of the kind 
likely to arise in practical problems, the computer needs all the help it can 
get. 

Finally, it should be mentioned that when mathematical or numerical 
solution is impractical, simulation is often a useful tool for analysis. (This 
topic will be addressed in Chapter 6.) Simulation can be used to study 
processes that are much more complicated than those that can be studied 
by mathematical and numerical analysis. However, if an appropriate 
mathematical or numerical solution can be obtained, it is often more 
useful than simulation data. 

4«7. The Equivalent Random Method 

Consider the following important queueing system. Customers arrive in n 
independent Poisson streams, each stream directed at one of n different 
primary groups of servers. All customers who find all servers busy in their 
primary group are directed toward a single overflow group. The overflow 
group thus provides a common backup for the n primary groups. The 
service times on all groups are assumed to be independently and identi- 
cally distributed negative- exponential random variables. Customers who 
are blocked on the overflow group are cleared from the system. The 
operations researcher might be interested in determining the numbers of 
servers and the particular server arrangements that ensure that a prespeci- 
fied probability of loss (overflow from both primary and overflow servers) 
is not exceeded. 

We shall describe a technique, called the equivalent random method, for 
the approximate analysis of such a system. The theoretical basis of the 
equivalent random method has already been discussed in Section 4.3. We 
begin with a discussion of the simplest overflow system, which consists of a 
single primary group with one server and an overflow group with one 
server, as illustrated in Figure 4.6. 

Let P l be the proportion of time that the overflow server is busy. The 
load carried on the overflow server is the difference between the load 
aB{\,a) that overflows the primary server and the load aB(2,a) that 
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aB(2, a) 

! Overflow server 

4 

aB( 1 , a) 

! Primary server 

! 

a erl 

Figure 4.6 



overflows both the primary server and the overflow server. Since the load 
carried on a single server equals the proportion of time that server is busy, 
it follows that 

P x = a[B(ha)-B(2,a)], (7.1) 

where B(s, a) is the Erlang loss formula. 

Now consider the probability 71, that a customer who overflows the 
primary server finds the overflow server busy. Since the customers arrive at 
the two-server (primary and overflow) system in a Poisson stream, they see 
the same distribution of states on the two-server system as would an 
outside observer. It follows that JT, is the conditional probability that the 
overflow server is busy, given that the primary server is busy. Applying the 
definition of conditional probability, we have 



B(l,a) ' 



which can be written 



We can conclude from Equation (7.3) that the proportion of overflow 
customers who find the overflow server occupied, and thus are cleared 
from the system, is the ratio of the load overflowing the second server to 
the load overflowing the first server. Hence, viewing the overflow server as 
a system by itself, whose input stream is the overflow stream from the first 
server, the proportion of customers who find the overflow server busy is 
the ratio of the load not carried (that is, overflowing the second server) to 
the load offered (that is, overflowing the first server). 



The Equivalent Random Method 



167 



Comparison of Equations (7.1) and (7.2) shows that 



(7.4) 



from which we conclude that the overflow stream from the first server is 
not a Poisson stream. This should also be apparent from the fact that the 
probability that a customer will request service from the second server in 
any interval (/, t + h) is Xh + o(h) as A— >0 if the first server is busy at time /, 
and zero if the first server is idle at time /. If the overflow stream were 
Poisson, then this probability would be Xh + o{h) as h^>§, independent of 
any other considerations. 

It should now be clear that the following analogous properties are true 
for overflow systems composed of a single j-server primary group and a 
single c-server overflow group: 

1. The overflow stream is not Poisson. 

2. Of those customers who overflow the primary group, the proportion 
II who are also blocked on the overflow group is the ratio 



aB(s + c,a) 
aB(s,a) 



(7.5) 



The situation can be represented by Figure 4.7. Property 1 implies that 
overflow systems cannot, in general, be studied by the elementary methods 
developed thus far in this text (except through the use of multidimensional 
state equations, which is usually not practical). Property 2 is used in the 
equivalent random method, which was developed by Wilkinson [1956] (a 



Figure 4.7 



a' = aB(s + c, a) 



n s + c — 



aB(s + c,a) ^ 



a = aB(s, a) 
I 



n c = 



aB(s + c,a) 
aB{s, a) 



I 

a 
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similar method is given by Bretschneider [1956]) to facilitate the approxi- 
mate analysis of more complicated overflow systems, and to which we now 
direct our attention. 

Consider the system on the left-hand side of Figure 4.8. In this system, a 
group of c servers acts as the common overflow group for all traffic 
overflowing the n primary groups. The zth primary group is assumed to 
consist of s t servers, and to receive a direct Poisson load of a ( erl. The 
service times on all groups are assumed to be independently and identi- 
cally exponentially distributed. 

We shall analyze this system by means of an approximation technique 
widely known in teletraffic engineering as the equivalent random method. 

For the purposes of our approximate analysis, we represent the load that 
overflows the /th primary group by two parameters, the mean a t and 
variance v if which are calculated according to Equations (3.1) and (3.2): 

= a,.5(j ( , a,) (i = l,2,...,/»), (7.6) 

"H^ ^+i+VJ ( '- 1 ' 2 -- n) - (7J) 

That is, the load overflowing a primary group is characterized by the mean 
and variance of the equilibrium overflow distribution that would result if 
the overflowing customers were handled by an infinite-server overflow 
group. We denote by a' the mean value of the load that overflows the 
c-server overflow group. 

We wish to calculate the proportion of customers who overflow the 
c-server overflow group, given that they overflow a primary group. That is, 
we wish to calculate the probability that a customer who finds all s f servers 
busy in his primary group will also find all c servers busy in the overflow 

Figure 4.8 



t t 



{a, v) 
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group. Although in reality the value of this probability will, in general, be 
different for different values of index /, it seems reasonable that the value 
of the ratio 



should provide a meaningful approximation to this probability. It remains 
to calculate a'. 

To this end, we replace the n primary groups and their respective 
Poisson loads with a single "equivalent random" primary group and a 
single "equivalent random" load, as indicated on the right-hand side of 
Figure 4.8. The number s of servers in the equivalent random group and 
the size a (in erlangs) of the equivalent random load are chosen such that 
the mean a and variance v of the total overflow load offered to the 
overflow group remain constant. Thus, since the primary groups are 
independent of each other, we have 

« = <*,+ ••• +a„, (7.8) 
©«©,+ ••• +tv (7.9) 

Now the values of the equivalent random load a and the equivalent 
random group size s are chosen to satisfy the following equations for the 
(known) moments a and v: 

a = aB(s y a), (7.10) 

v~a(\-a+—r^ ). (7.11) 

Then it follows that 

a' = aB(s + c,a). (7.12) 

In summary, the equivalent random method is to replace the system on 
the left-hand side of Figure 4.8 with the system on the right-hand side, 
where the values a and s are chosen such that the resulting overflow 
moments a and v remain constant. Then, in this new "equivalent random" 
system, of those customers who overflow the j-server primary group, the 
proportion who also find all c servers busy on the overflow group is, 
according to Equation (7.5), 

«gC, + c,a) _«: ( 

c aB(s,a) a v ' 



We take the value JT C , calculated according to Equation (7.13), as the 
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probability of blocking on the overflow group suffered by overflow traffic 
from the primary groups. Likewise, the proportion JI of all customers who 
are blocked on both the primary and overflow groups is approximated by 

u- aB ( s+c ^ . (7.i4) 

a x + a 2 + ■ • • +a n 

It remains only to discuss how to solve Equations (7.10) and (7.11) 
numerically for the values a and s that correspond to a given pair a and t>. 
Curves that effect this solution are given in Wilkinson [1956]. A useful set 
of tables and curves, with accompanying text and examples, is given in 
Wilkinson [1970]. 

An easy and fairly accurate approximate solution to Equations (7.10) 
and (7.11), given by Rapp [1964], is (where z = v/a is the peakedness 
factor) 

a = v + 3z(z-\) (7.15) 
and, solving Equation (7.11) for s, 

s = — V 7 I- (7.16) 

a + z — 1 

(Note that z > 1; see Exercise 15.) These estimates of a and s are generally 
on the high side of the exact values. By rounding s down to its integral part 
[s], the corresponding value of a that produces the desired peakedness 
factor z is then found by solving Equation (7.16) for a: 

(\s]+a + l)(a + z-l) 
fl= !LJ !1 L m (7.17) 

a + z v ' 



Exercises 

15. Show that, if a > 0, then 



v — a when s = 0, 
v >a when s > 1, 



where a and v are defined by Equations (3.1) and (3.2). 



16. a x — 10 erl of Poisson traffic is offered to a group of 10 servers, and a 2 = 5 erl 
of Poisson traffic is offered to a group of 5 servers. An overflow group of c 
servers handles the overflow from the two primary groups on a BCC basis. 
Service times are independent, identically distributed, exponential variables. 

a. Find the value of c that gives a loss of 10% on the overflow group. 

b. Find the loss for the system as a whole. 

c. Find the loss on the overflow group and the loss for the system as a whole 
if flj and a 2 are each increased by 50%. 
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Suppose now that an overflow system is composed of a single primary 
group to which two independent Poisson loads a x and a 2 are offered, and 
an overflow group that receives only the overflow from a v To determine 
the proportion of overflow customers (from who are blocked on the 
overflow group, one would calculate the overflow mean and variance 
associated with only the load a v These values a, and v x , which can be 
calculated according to the formulas (1) and (2) given in Exercise 12, 
would then determine the equivalent random values a and s. 

Exercise 

17. Poisson traffic totaling 10 erl is offered to a group of 10 servers. This stream 
of Poisson traffic is evenly divided between two types of customers, called 
high- and low-priority customers. Both substreams of traffic are Poisson, and 
service times have a common exponential distribution. The high-priority 
customers who overflow the 10-server group are routed to an overflow group 
of c servers (where they are handled on a BCC basis). The low-priority 
customers who overflow the 10-server group are cleared from the system. 
Determine the smallest value of c such that the proportion of high-priority 
customers who do not receive service is no greater than 0.01. 

The equivalent random method has proved to be very useful in tele- 
traffic engineering, and doubtless will prove useful in other applications. 
For extensions and further discussion on the equivalent random method 
and its applications see, for example, Katz [1967] on the application of the 
equivalent random method to analysis of large communication networks 
with alternate routing, Neal [1971] on extending the equivalent random 
method to allow analysis of correlated streams of overflow traffic, Holtz- 
man [1973] on the accuracy of the equivalent random method, Fredericks 
[1980] on a related method for the analysis of systems with overflow traffic, 
and Kuczura [1973] on a method for approximating overflow streams. 



4.8. The Method of Phases 

The method of phases (or stages) is an approximation procedure according 
to which a random variable with an arbitrary distribution is replaced by 
either a sum, or a mixture, or a combined sum-mixture of independent (but 
not necessarily identical) random variables (each being a phase or stage of 
the lifetime of the original random variable). This technique allows trans- 
formation of the original model into a multidimensional birth-and-death 
model, thus making it amenable to the powerful tools of analysis that have 
been described in this chapter. 

Consider, for example, the s-server Erlang loss system; we assume that 
blocked customers are cleared, arrivals follow a Poisson process with rate 
A, and service times are independent, identical, random variables with 
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arbitrary distribution function H(t). In this example of the application of 
the method of phases, we assume that the service time X can be approxi- 
mated by a sum of n independent, but not necessarily identical, exponen- 
tial random variables, 

X = X,+ --- +X n ; (8.1) 

that is, we imagine that the service time X is composed of n independent 
phases of service, the ith phase being exponentially distributed with 
distribution function /;.(*)= P{X t < t) = 1 - e'*'. Then £(Ar) = 2?_, /a" 1 
and K(A") = 2" =l ju~ 2 . (In particular, when [i x = ■ ■ - = /a„, then X has the 
Erlangian distribution of order «, whose density function is pictured, for 
various values of n, in Figure 2.3.) Since it is true that (2/_ i ') 2 > 
2"_i(ju>- -1 ) 2 (see Exercise 19), it follows that any service time described by 
a random variable X, where E(X)> V V(X) , can be represented as a sum 
of independent, exponential phases, as in (8.1), with the given mean and 
variance. Furthermore, by judicious choice of the values « and ju,,. (i = 
1,2, ...,«), other moments might also be fitted to better approximate the 
given service-time distribution. Of course, the phases X v ...,X n do not 
necessarily correspond to any actual phases of service, but are only 
artifices introduced for the purpose of approximating the original process 
by a birth-and-death process. 

Now suppose the service time X has greater variabilit y than the ex- 
ponential distribution prescribes; that is, assume E(X) < V V(X) . In this 
case, we can model the random variable X as a parallel arrangement of 
exponential phases (Morse [1958]); that is, the realization of X is obtained 
by choosing, with probability p ( , the realization of the random variable X { . 
Thus, the distribution function of X is F x (t) = '2 a l ^ 1 p f F l (t) t where, as 
before, /;.(/) = l-r h '; then E(X) = l p^r 1 and V(X) - 
227 = ,^ j jh, _2 -(2"„ 1 ^ j ju~ 1 ) 2 . In this case, X is said to be a mixture of 
exponentials, and F x (t) is called the hyperexponential distribution f unction . 
We leave it to Exercise 19 for the reader to show that E(X)> V^(J) 
when X is modeled as a serial arrangement of exponential phases, and 
E(X) < V V(X) when X is modeled as a parallel arrangement of ex- 
ponential phases. 

Thus, as a practical matter, one may try to approx imate an arbitrary 
random variable J by a sum [when E(X)> ^/ V{X) ], a mixture [when 
E{X)<y V{X) ], or a combined sum-mixture (see, for example, Klein- 
rock [1975]). In fact, as we shall discuss shortly, such an approximation can 
be made with any desired degree of accuracy. 

To continue with our example of the method of phases applied to the 
5-server Erlang loss system, suppose that a representation of the form (8.1) 
has been fitted to the original data or hypothesized service-time distribu- 
tion. For ease of exposition let us assume n = 2. Now, if we let PU1J2) be 
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the equilibrium probability that simultaneously there are j\ customers in 
phase 1 of service and j 2 customers in phase 2, the corresponding conserva- 
tion-of-flow equations are 

/*, +j 2 ih)P{j l J 2 )=\P{j x ~ 1 Jl) 

+U\ + i)n l PU\+ij 2 - i)+(y 2 + 1 W0v 2 + 1) 

Ui+J 2 <s) (8.2) 

and 

0iMi+y2^WiJ 2 )=^0i- iJ 2 )+0'i+ i)i^C/i + hj 2 - 1) 

Ui+J 2 = s). (8.3) 
Equations (8.2) and (8.3), together with the normalization equation, 

^P{j x J 2 )=h (8.4) 

can now be solved, numerically or otherwise, for the probabilities P(j\J 2 ) 
(y, +j 2 <s), from which we can calculate the equilibrium probability Pj 
that there are j customers in service: 

Pj= 2 P(JvJ 2 ) = 2 P{Kj-k) (y = 0,l,. ..,5). (8.5) 

It is interesting to note that the method of phases was originally 
suggested by Erlang himself. A generalization of the method, using com- 
plex-valued probabilities, has been given by Cox [1955]. Gaver [1954] has 
proposed an extended method of phases, in which the number n of terms in 
the sum (8.1) is itself a random variable (that is, X is a compound random 
variable). Recently, Neuts [1975a] has greatly extended these ideas with his 
introduction of the notion of probability distributions of phase type, of which 
the Erlangian and hyperexponential distributions are very special cases. 
Neuts has applied his theory to develop computational algorithms for the 
solution of a wide variety of problems in queueing theory and related 
areas; see, for example, Neuts [1975b, 1977, 1978a, 1978b, 1980, 1981]. 

A particularly interesting aspect of the method of phases is that it can 
sometimes be used to obtain exact results by use of a limiting process. This 
idea has been developed by Schassberger [1973, pp. 32-33], who proves a 
theorem that states, roughly speaking, that any nonnegative random vari- 
able X can be represented as accurately as desired by a compound sum 
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(that is, the number of terms in the sum is itself a random variable) of 
independent, identical, exponential variables. 

We now illustrate this idea by using the ordinary method of phases to 
prove the important theorem that was asserted in Section 3.3, namely, that 
the Erlang loss distribution, given by (3.3) of Chapter 3 and repeated here 
for convenience, 

Pj r = 0,1,..., s), (8.6) 

^ (X//*)* 

is valid for any service-time distribution function with finite mean /i*" 1 . 

To see this, observe that Equations (8.2) and (8.3), which describe an 
Erlang loss system in which service times are composed of a sum of two 
exponential phases with means /if 1 and juf are satisfied by the product 
solution 

p( . (x/ Ml y (x/y^ ,, +/ ^ 

HJvJ2)= — n Ti — c \J\+J2< s h (8.7) 

[Compare (8.2) with the equations (2.13) that describe two queues in 
tandem.] Substitution of (8.7) into (8.5) gives, after application of the 
binomial theorem, 

(- + -y 

Pj= ,, c 0-0,1 s); (8.8) 



if we set 



1--L + -I 

M Mi M2 



in (8.8) and normalize, the result is (8.6), as promised. It is easy to see the 
assumption that the service time is composed of n = 2 phases is irrelevant; 
the same result (8.6) would be obtained for any number n of phases and 
corresponding mean values ju, -1 whose sum is ji -1 . 

Thus we have proved the theorem in the special case where the service 
time is modeled as a serial arrangement of exponential phases. We leave it 
to Exercise 18 for the reader to show that the Erlang loss distribution 
results when the service time is modeled as a parallel arrangement of 
exponential phases. These results strongly suggest that our result holds for 
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all service-time distributions. Schassberger's theorem shows that the argu- 
ment made here can be sharpened to provide a rigorous proof. For other 
proofs, see the references given in Section 3.3. 

For an example of the application of the method of phases to the study 
of networks of queues, see Kelly [1976] and Barbour [1976]. 



Exercises 

18. Consider the Erlang loss system with hyperexponential service times; that is, 
assume that each arrival is assigned a service time chosen randomly and 
independently from one of (for simplicity) two different exponential distribu- 
tions. (Thus, if the two distributions have means /if 1 and /i 2 ~ ! > and are chosen 
with probabilities p x and p 2 = a I ~P\, respectively, then the service-time density 
function is p x ^ e ~^ xt ^Pi^-ifi ^ ^d the mean service time is p\fi\' 1 + 
P2P2 1 ) Show that this system is described by the Erlang loss distribution 
(8.6). 

19. Show that if a random variable X is given by a sum of independent, 
exponential random variables, then E(X) > ^/V(X) ; whereas if A' is given by 
a mixture of independent, exponential random variables, then 
E(X)<Vv(X) . 
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5.1. Introduction 

In this chapter we investigate some important queueing models that cannot 
be studied in the framework of the birth-and-death process. 

Recall that a queue is characterized by the input process, the service 
mechanism, and the queue discipline. In the preceding chapters we con- 
centrated mainly on queues with Poisson or quasirandom input and 
exponential service times. These assumptions imply that the future evolu- 
tion of the system from some time t depends only on the state of the 
system at time t, and is independent of the history of the system prior to 
time /. In these models, the "state" of the system could always be specified 
in terms of the number of customers present. (In the multidimensional 
case, the state was specified in terms of the number of customers of each 
type present at time t.) 

Suppose that we are interested in a queue for which knowledge of the 
number of customers present at any time t is not sufficient information to 
permit complete analysis of the model. For example, consider the case in 
which the service times are assumed exponential, but the customer's arrival 
epochs are separated by a constant time interval. Then the future evolution 
of the system from some time t would depend not only on the number of 
customers present at time t, but also on the elapsed time since the last 
customer arrival epoch (because the arrival epoch of the next customer is 
strictly determined by the arrival epoch of the last customer). 

Clearly, a new method of analysis is required. A powerful method for 
the analysis of certain queueing models, such as the model in the above 
example, is that of the imbedded Markov chain, introduced by Kendall 
[1951, 1953]. As with the birth-and-death process, there is a vast theory of 
Markov chains. More generally, both are subsumed under the heading 
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Markov processes. As is our policy, we shall aim at as direct an approach 
to the analysis of our queueing models as possible, without extended 
excursions into the surrounding theoretical structure. Thus we shall in- 
troduce the main ideas behind the theory of the imbedded Markov chain, 
and show how these ideas facilitate the analysis of certain important 
queueing models. 

We will study two important basic models, and some of their variations, 
which are easily analyzed by the method of the imbedded Markov chain. 
These two models are, in the notation introduced by Kendall [1953], the 
M/G/l queue and the GI/M/s queue. (We shall explain Kendall's 
notation shortly.) 

The M/G/l queueing model assumes that customers arrive according 
to a Poisson process; the distribution of service times is arbitrary; there is 
one server; and all blocked customers wait until served. This model differs 
from the 1 -server Erlang delay model we studied earlier only in that the 
service times are no longer required to be exponentially distributed. This 
increase in generality requires a corresponding increase in the complexity 
of the mathematical apparatus needed for its analysis. 

The second basic model we shall study, the GI/M/s queue, assumes 
that the interarrival times are independent, identically distributed, non- 
negative random variables; the service times are exponentially distributed; 
there are s > 1 servers; and all blocked customers wait until served. This 
model differs from the 5-server Erlang delay model we studied earlier only 
in that the interarrival times are no longer required to be exponentially 
distributed. As with the M/G/l queue, the increased generality of the 
model requires mathematical tools beyond those required for birth-and- 
death models. In both cases, the main new mathematical tool we shall use 
is the imbedded Markov chain. 

Kendall's shorthand notation a/b/c is now widely used to describe 
queueing models. In this notation, a specifies the arrival process, b speci- 
fies the service time, and c is the number of servers. In every case, it is 
assumed that all blocked customers wait until served. Of course, this 
notation can't characterize every model, so extended versions have been 
introduced to permit specification of other system characteristics, such as 
the order of service or the number of waiting positions if there is a 
maximum allowed queue length. However, these extensions are not univer- 
sally accepted, and often seem to be clumsier than a simple longhand 
characterization of the queueing model. 

Some examples of Kendall's notation are 

M/G/\ — Poisson (Markov) input, General (arbitrary) service-time dis- 
tribution function, 1 server; 

GI/M/s — General, independently distributed interarrival times, ex- 
ponential A/arkov) service times, s servers; 
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M/D/s — Poisson input, constant (deterministic) service times, s 
servers; 

E k / M/s — A>phase £rlangian interarrival-time distribution function, ex- 
ponential service times, s servers; 

Mf M/s — Poisson input, exponential service times, s servers (the Erlang 
delay model). 

Of course, each of the above models is only a special case of the model 
GI/ G/s. But it is often true that the more general the analysis, the less 
specific (and useful) is the derived information. Such is the case here. The 
models that appear to be optimal with respect to the tradeoff between 
generality and specificity are the M/GJX and GI/ M/s models (and some 
of their variations), to which we devote this chapter. References for these 
models will be given in the text. For a discussion of the more general 
model GI/G/s, which we shall not discuss, see De Smit [1973] and the 
papers referenced therein. In preparation for the analyses of these models, 
we start with a discussion of some important related results. 

5.2. The Equation L = \W (Little's Theorem) 

Consider a system in equilibrium in which customers arrive, remain in the 
system for a length of time (called the waiting time), and then depart. Let A 
be the arrival rate, W be the mean waiting time, and L be the mean 
number of customers present. (We are adopting this notation to be 
consistent with common practice.) Then, if the mean values A, W, and L 
exist, they are related to each other according to the equation (which holds 
for each realization of the process) 

L = \W. (2.1) 

The well-known theorem embodied in Equation (2.1) (which is often 
referred to as Little* s theorem, or simply as "the equation L=\W") is one 
of the most general and useful results in queueing theory. It was first stated 
in a rigorous manner by Little [1961], Little's original statement of (2.1) 
required that the system under consideration possess certain conceptually 
difficult mathematical properties (e.g., metric transitivity), and his proof 
was similarly nonintuitive. Since the publication of Little's paper, many 
authors have attempted to simplify the hypotheses under which (2.1) holds, 
or present proofs that are intuitively more appealing, or give heuristic 
arguments that convincingly explain (2.1) even though they lack rigor. 
Recently, Stidham [1974] published a relatively simple yet rigorous proof 
of (2.1), which is exceedingly general in that it requires only the existence 
and finiteness of the mean values A and W. 

In this section we will give some examples to illustrate the range and 
utility of (2.1), followed by a simple heuristic argument to "explain" (2.1) 
and then an outline of Stidham's proof. 
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Consider, for example, the relationship between offered load and carried 
load that has been used repeatedly throughout this text on heuristic 
grounds, namely, carried load equals the portion of the offered load that is 
carried. Offered load a, by definition, is the product of the arrival rate and 
the mean service time; hence, in the language of (2.1), the offered load 
equals \W, Carried load a', by definition, is the mean number of busy 
servers; hence, in the language of (2.1), the carried load equals L. As an 
application of this idea, consider the equilibrium j-server Erlang loss 
system, where \ is the parameter of the Poisson arrival stream [not the 
same quantity as defined in (2.1)], and t is the mean service time. We 
define the "system" as the group of s servers. The left-hand side of (2.1) is, 
by definition, the carried load a! . To calculate the right-hand side of (2. 1), 
note that the rate at which customers enter this system is A(l — B(s,a)), and 
the mean time spent in the system by those who enter is the mean service 
time t. Therefore, (2.1) becomes 

a'=[\(\-B(s,a))]-T, (2.2) 

where, by definition, a = Xt. Hence, (2.2) becomes 

a' = a(\-B{s,a)), (2.3) 

which is the same as Equation (3.5) of Chapter 3. 

For another example, consider again Exercise 28 of Chapter 3. In part a 
of that exercise, the reader is asked to verify (2.1) by direct calculation of L 
and W, where L is the mean number waiting in the queue and W is the 
mean waiting time for service. That is, in part a the "system" is defined to 
include only the queue, not the servers. In part b of the same exercise, the 
reader is asked to again verify (2.1), this time defining the system to 
include both the queue and the servers. In either case, the calculation is 
routine. 

Another example is given by Exercise 37 of Chapter 3, where the reader 
is asked to verify Equation (8.18) of Chapter 3, which can be written 

£jPj[n]-<m[p- l + E(W)]. (2.4) 

The left-hand side of (2.4) equals the mean number of customers present 
(waiting in the queue and in service) in an «-source equilibrium queueing 
system with quasirandom input, exponential service times (with mean fx ~ '), 
and BCD queue discipline. Since a is the offered load, therefore api = X; 
furthermore, the term in brackets on the right-hand side of (2.4) equals the 
mean time a customer spends in service or waiting in the queue. Hence, 
Equation (2.4) is simply a special case of (2.1) and needs no further 
verification. 
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As a final example, consider Equation (8.20) of Chapter 3, 

n = \(y- l + E(T)), 

which was obtained, after much calculation, for the model in which n 
sources submit quasirandom input to s exponential servers, with the queue 
discipline blocked customers delayed. We assert that this equation holds 
not only for that model, but also for the more general model in which each 
of n sources submits requests at rate y when idle, and experiences a 
sojourn time T before returning to the idle state, regardless of the 
complexity of the network through which the sources travel, and regardless 
of the distributions that describe the service times and interarrival times. 
To prove this assertion, simply observe that we can identify n as L and 
{y- x + E{T))*& W. 

Of course, Equation (2.1) is most useful when one wants to know W and 
it is difficult to calculate it directly, but L is easy, or vice versa. This is 
often the case, because L is a time-average of a discrete-valued random 
variable, whereas W is a customer-average of a (in general) continuous-val- 
ued random variable. 

For example, we can use (2.1) to prove easily an important assertion 
made previously in this text: In any queueing system with a nonbiased queue 
discipline, the mean wait for service suffered by an arbitrary customer is 
independent of the order of service. (Recall that a nonbiased queue disci- 
pline is one in which customers are selected for service from among those 
waiting without regard to the service times of the waiting customers; that 
is, the next customer chosen for service has the same service-time distribu- 
tion function as does an arbitrary customer.) 

To prove this assertion, note that according to (2.1) the mean wait W is 
proportional to the mean queue length L. But the state equations from 
which one calculates the distribution of the number of customers in the 
queue do not depend on the identity of the particular customer chosen for 
service at any time; that is, the distribution of the number of customers 
waiting in the queue does not depend on the order of service if the queue 
discipline is nonbiased. Therefore, the mean queue length L is independent 
of the order of service, and hence, by Equation (2.1), so is the mean 
waiting time W. 

Note that, in general, the distribution function of waiting times does 
depend on the order of service for nonbiased queue discipline, even though 
its first moment W does not. 

The following heuristic argument in support of (2.1) was suggested by P. 
J. Burke. Assume that the mean values L and W exist, and consider a long 
time interval (0,0 throughout which statistical equilibrium prevails. The 
mean number of customers who enter the system during this interval is Xt. 
Imagine that a waiting time is associated with each arriving customer; that 
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is, each arrival brings a waiting time with him. If the average waiting time 
brought in by an arrival is W, then the average amount of waiting time 
brought into the system during (0,f) is XtW. On the other hand, each 
customer present in the system uses up his waiting time linearly with time. 
If L is the average number of customers present throughout (0,0, then Lt 
is the average amount of time used up in (0,/). Now, as ?-*oo the 
accumulation of waiting time must equal the amount of waiting time used 
up; that is, the ratio of the waiting time accumulated in (0,/) to that used 
up in (0,/) must approach unity as t^oo. Hence, 

lim -^5- = 1 (roughly speaking), 

from which (2.1) follows. 

We conclude this section with Stidham's [1974] statement of the theorem 
(2.1) and an outline of his proof. For further discussion see Brumelle 
11971], Heyman and Stidham [1980], and Miyazawa [1977]. 

Theorem. Let L(x) be the number of customers present at time x, and 
define the mean number L of customers present throughout the time 
interval [0, co ) as 

L= Urn - CL{x)dx\ (2.5) 

/— >oo t Jq 

let N(t) be the number of customers who arrive in [0,t], and define the 
arrival rate X as 

\=\i m M!l ; (2.6) 

/— »oo t 

and let be the waiting time of the /'th customer, and define the mean 
waiting time W as 

1 " 

W= lim - 2 W r (2.1) 



n-*oo n 



i-1 



// X and W exist and are finite, then so does L, and they are related 
according to (2.1). 

proof. Our strategy is first to show that for all t > 0, 

1/(0 > ( l L(x)dx>V(t), (2.8) 



where U(t) is defined as the sum of the waiting times of the customers that 
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have arrived in [0, t], and V(i) is defined as the sum of the waiting times of 
the customers that have (arrived and) departed in [0, t\. We then rewrite the 
inequality (2.8) as 

if >i/; w >ii£) ( 2.9) 



and establish that 



and 



lim = \W (2.10) 



Hm BH= lim Jlf). (2.U) 



Our theorem then follows from (2.9)-(2.11) and the definition (2.5). 

To prove (2.8), let 7), i — 1,2,..., be the arrival time of the /th customer 
(0< T x < r 2 < • • • ), and write 

U(t)= 2 W t , (2.12) 
■Si(0 

where S x (t) is the set of values of the index / for which I) < /, 



and similarly, we write 



S i (t) = {i:T i <t}; 
V{t)= 2 w it 



S 2 (t) = {i:T,+ W t <t). 



(2.13) 
(2.14) 

(2.15) 



where S 2 (t) is the set of values of the index i for which T t + W t < t> 



The truth of the inequality (2.8) is now easy to see if we use the following 
economic interpretation. Suppose that the system incurs a unit cost for 
each unit of time spent in the system by each customer. Then U{t) is the 
total cost incurred in [0, /] if each customer's cost is charged in a lump sum 
at the instant of his arrival; j l L(x)dx is the total cost incurred in 10,/] if 
each customer's cost is charged (in the usual way) continuously at unit rate 
while he is in the system; and V(t) is the total cost incurred in [0,/] if each 
customer's cost is charged in a lump sum at the instant of his departure. 
With this interpretation, (2.8) becomes obvious. (Note that this argument is 
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essentially the same as the heuristic argument given earlier. Indeed, this 
proof of L=\W is, in fact, a rigorous treatment of the earlier heuristic 
argument.) 

Our next step is to establish (2.10). To this end, we calculate the product 
XW using the definitions (2.6) and (2.7): 

A H^^)(^it,4 (2I6) 

Now replace the second limit on the right-hand side of (2.16) by the 
equivalent representation 

lim - 2 W-= lim — 2 w„ (2.17) 

n^oo ri ,„i t^co N(t) (=1 

and use the fact that (lima) (lim b) = lim ab; then (2.16) becomes 

\W= lim - 2 W t . (2.18) 

/^oo t , = 1 

Observe that the sum on the right-hand side of (2.18) is the same as that in 
(2.12). Hence, we have proved (2.10). 
It remains only to prove (2.1 1). To this end, observe that 

lim-^=0. (2.19) 

Equation (2.19) becomes obvious when one realizes that the waiting times 
W if i= 1,2, do not systematically increase with increasing /, whereas the 

arrival times T if / = 1,2, , do become arbitrarily large as /'-»oo. (For a 

proof of (2.19), see Lemma 4 of Stidham [1972].) It follows from (2.19) that 
for any e>0, there exists a number k such that WJT^e for all i>k; that 
is, 

W^<7> for all i>k. (2.20) 

Now suppose / is large enough so that N(t)>k. Observe that the index set 
S 2 (t), defined by (2.15), can be partitioned into the union of the two sets 
S 3 (0 and S 4 (/), defined by 

S 3 (i)={i'-T i + W ( <t and/<fc} (2.21) 

and 

S 4 (t) = {i:T i + W;<t and/>£). (2.22) 
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(That is, S 3 (t) and S 4 (t) partition the set S 2 (t) of customers who arrive and 
depart in [0,t] into those who arrive in [0,T k ] and those who arrive in 
(T k ,t\.) Hence, we can write (2.14) as 

V{t)= 2 Wi+ 2 W t . (2.23) 

S 3 (t) S 4 (t) 

Now consider the index set S 5 (t), defined by 

S 5 (t) = { i : 7} + 7}e < t and / >k] (2.24) 

Since 7}e>Wj by (2.20), it follows that S 4 (t) contains at least as many 
points as S 5 (t); that is, S 5 (t) c 5 4 (r), and therefore 

2 W t < 2 Wt- (2-25) 
It follows from (2.23) and (2.25) that 

v{t)> 2 Wi+ 2 ^- ( 2 - 26 ) 

By adding and subtracting the quantity 2 S we can write (2.26) as 

P(0> 2 2^+2^+2 W it (2.27) 

S 3 (/) 5 6 (/) S 6 (/) S 5 (t) 

where 

S 6 (0 = {* : 7; + </ and / <*}. (2.28) 

The last two sums on the right-hand side of (2.27) can be combined, so 
that (2.27) can be written 

V{t)> 2^-2^+2^ (2.29) 

s 3 (o s 6 (t) Sjit) 

where S 7 (t) is the union of S 6 (t) and S 5 (t): 

S 7 (t)={i:T i +T i e<t} = [i:T i <~]. (2.30) 
Comparison of (2.13) and (2.30) shows that 



«'>- 5 '(7b)' 



(2.31) 
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and it then follows from (2.12) and (2.31) that (2.29) can be written 

v{t)> 2 w- 2 ^ + f(-rb); (2.32) 

that is, 

m > m > \[ 2 Wi _ 2 w \ + Mm±M, (2 . 33) 

where the first inequality in (2.33) follows from (2.9). 

Now let t^<x> in (2.33). The sums in parentheses in (2.33) are bounded 
(by 2 /<t H^), and therefore 

lim -( 2 W- 2 w\=0. (2.34) 

Furthermore, 

llm tMl±i» = lim 7 ^l 7 - T J- Urn m. (2 . 35) 

Hence, if we take limits in (2.33) we obtain 

lim m >lim m > > lim jsa (2 36) 

/— >oo / /->oo / 1+C t-+oo t 

The statement (2.36) is valid for any e>0, no matter how small; hence 
(2.1 1) follows from (2.36), and the proof is complete. □ 



53. Equality of State Distributions at Arrival and Departure Epochs 

Often, queueing models are analyzed by writing equations that relate state 
probabilities to one another at certain well-defined points in time (epochs). 
The choice of the particular set of points with respect to which the 
equations are written is governed by the mathematical properties of the 
model under consideration, not by unfettered choice. For example, the 
M/G/l model is easily analyzed by relating the system states to each 
other at the instants at which customers finish service and leave the system. 
Therefore, we define the state of the system to be the number of customers 
left behind by a departing customer, and the corresponding departing 
customer's distribution {II*} describes the system at this special set of 
points. However, the quantities of interest are the state probabilities at a 
different set of points, namely the arrival epochs. That is, from the point of 
view of an arriving customer, the number of customers that he finds in the 
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system, not the number he leaves behind, is the quantity of interest. 
Fortunately, for a wide range of processes that includes the important 
ones, the state distributions at these two sets of points are identical. 

We have already encountered some similar situations. In the M/M/s 
(Erlang delay) model, the birth-and-death formulation led directly to the 
outside observer's distribution {Pj(t)}. But it was shown in Chapter 3 that 
the arriving customer's distribution {12,(0} is identical to the outside 
observer's distribution {Pj(t)} whenever the input is Poisson. Thus the 
more relevant arriving customer's distribution had been found indirectly. 
Likewise, the birth-and-death formulation for the case of quasirandom 
input in Chapter 3 yielded the outside observer's distribution, which, 
though not directly equal to, was easily translated into the arriving 
customer's distribution. 

We have the following important theorem relating the equilibrium 
distribution {JJj} of the number of customers found by an arrival (the 
arriving customer's distribution) and the equilibrium distribution {II*} of 
the number of customers left behind by a departure (the departing 
customer's distribution): 

in any queueing system for which the realizations of the state process 
are step functions with only unit jumps (positive and negative), the 
equilibrium state distribution just prior to arrival epochs is the same as that 
just following departure epochs: {IIj} = {II?}. (And, in addition, when the 
input is Poisson, [Fj] = {IZ,} = {II/}.) 

Note that in this theorem, as in the theorem L =>\W, the term "queueing 
system" is not used in a restricted sense. For example, the "system" may 
be defined as the servers and the queue, or it may be defined as the queue 
alone. The essential characteristic is simply that the number of customers 
present can increase and decrease by at most one customer at a time. Thus 
the theorem holds for all systems (in statistical equilibrium) considered so 
far in this text. 

The theorem becomes intuitively obvious when one observes that during 
any time interval the number of upward transitions Ej-*E j+l cannot differ 
by more than one from the number of downward transitions E J+l ^>Ej. 
That is, for every customer whose arrival causes the transition Ej-*E J+l , 
there corresponds another customer whose departure causes the transition 
E J+l ^>Ej. Thus, the proportion of customers for whom the system is in Ej 
just prior to arrival approaches the same limit as the proportion of 
customers for whom the system is in Ej just after departure. 

Because of this theorem's importance (which will become obvious) and 
because (strangely) it does not appear to be well known even among 
experts, we now give a formal statement and proof. The theorem and proof 
are taken from an adaptation by P. J. Burke [1968, unpublished] of a 
corresponding result by L. Takacs for the M/G/\ queue. 
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Theorem. Let N(t) be a stochastic process whose sample functions are 
{almost all) step functions with unit jumps. Let the points of increase after 
some time t = be labeled T a , and let the points of decrease be labeled T' n , 

a = 1,2, Let N(T a — ) be denoted A a , and let N(T^+) be denoted D a . 

( Thus, if upward jumps correspond to arrivals and downward jumps corre- 
spond to departures, then A a is the state of the system just prior to the ath 
arrival epoch and D a is the state of the system just after the ath departure 
epoch.) Then if either lim n _ >00 i > {>4 H <k] or lim,,^^ P{D n < k) exists, so 
does the other and these are equal. [Thus N(t) has the same limiting 
distribution just prior to its points of increase as it does just after its points 
of decrease, if this limiting distribution exists.] 

proof. Let JV(0) = /. We first show that D n+i <k A n+k+l <k. Suppose 
D n+i =j<k. Then there are n+j of the T a preceding T' n+i . Thus A n+J+l < 
j, and, since k—j arrivals later than T H+ j+ l the state can be at most k— j 
greater, therefore A n+k+l <k. 

We will now show the converse, namely A n+k+l <k => D n+i <k. 
Suppose A n+k+l =j *kk. Then there are n + i + k—j of the T' a preceding 
T n+k+x \ that is, r„' +| . +w is the last T a preceding T n+k+1 . Hence D n+i+k _j 
<j and D tt+i <k. 

Therefore, P{T> n+i <k}~ P{A n+k+1 <£}, and both sides must have the 
same limit. Thus, for any k, 

Jim P { D n +t < k) - Urn P {A n + k+ , < *} ; 

that is, 

\imP{D n <k}^ \\m P{A n <k), 

and the proof is complete. □ 
Thus we have shown that, with the definitions 

LTj= \imP{N(T n -)=j} 0=0,1,2,...) (3.1) 

and 

n; = Jim P { N( T' n + ) =j } (j = 0, 1, 2, . . . ), (3.2) 

then 

Uj = nf = 0,1,2,...); (3.3) 

that is, when the state changes occur one at a time and statistical 
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equilibrium prevails, then the arriving customer's distribution { Tlj) and the 
departing customer's distribution {II*} are equal. 

Exercises 

1. (P. J. Burke [1968, unpublished].) Consider a birth-and-death process with 
birth rate A, and death rate when the system is in state E y (J — 0,\,...; /io = 0). 
Let {Ilj} be the state distribution just prior to arrival epochs (births). 

a. Prove that 

and compare with Equation (3.15) of Chapter 2. [Hint: Show that 

n;=njP{E J+ +n? +x p{ e j+ 

where {II*} is the state distribution just subsequent to departure epochs 
(deaths) and P{E J+l -*Ej} is the conditional probability that the next 
transition will take the system into state E p given that it is currently in state 

b. Use the result ( 1) of this exercise to prove that in any system whose state process consti- 
tutes a (one-dimensional) birth-and-death process with birth coefficients depending only 
on the number of idle sources (i.e., Xy[n] = ftn-j)), the arriving customer's 
distribution is the same as the outside observer's distribution for a similar system 
with one less source. [This proves, in particular, the result (6.8) of Chapter 3 for 
birth-and-death systems with quasirandom input.] 

2. Burke's theorem. For the M/M/s queue in equilibrium, the sequence of 
service completion epochs follows a Poisson process (with the same parameter 
as the input process); that is, the output process is statistically the same as the 
input process. 

proof. Let r, be an arbitrary service completion epoch in equilibrium, and 
let T 2 be the next consecutive service completion epoch; and define Fj{t) to be 
the joint probability that the number of customers in the system just after time 
T, + / equals j and T 2 >T X + t. 

a. Obtain a set of differential-difference equations for /)•(/)» an d show that the 
unique solution satisfying the initial condition /}(0) = II* is 

Fj(t)-I$e-». (1) 

b. Show that Equation (1) implies that the time separating two successive 
departures is exponentially distributed with the same mean as the inter- 
arrival times. 

c. To complete the proof of the theorem, show that the interdeparture intervals 
are mutually independent. [Hint: Because of the Markov property of the 
M/M/s queue, it is sufficient to show that the length of an interdeparture 
interval and the number of customers in the system at the start of the next 
interval are independent. To do this use the fact that the joint probability 
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that the interdeparture interval length is in (t,t + dt) and the number of 
customers in the system at the start of the next interdeparture interval isy is 
given by /> + i(/)mO"+ 0*. where m0" + = 0'+ »V+ 1 < s and 1) = 
sp. if j+ 1 >s y and is the average service time.] 

This theorem was proved independently by Burke [1956] (whose proof is 
outlined here) and by Reich [1957]. For a related discussion and references, the 
reader is referred back to the subsection "Queues in Tandem" in Section 4.2. 
Question: Can the assumption about the form of u.(/') be relaxed? 

5.4. Mean Queue Length and Mean Waiting Time in the 
M/G/l Queue 

In this section we will apply the results of the last two sections to obtain 
the following important results for the equilibrium M/G/l queue: The 
expected value of the number Q of waiting customers found by an 
arbitrary arrival and the expected value of the time W spent by an 
arbitrary customer waiting in the queue for service to begin are given by 



where t is the mean and a 2 is the variance of service times, and p = Xt< 1 
(where A is the arrival rate). 

Note first that (4.2), which is known as the Pollaczek-Khintchine formula, 
follows immediately from (4.1) and the equation L = \W. [Let "L" corre- 
spond to the mean number of customers waiting in the queue, from the 
viewpoint of the outside observer; and let " W" be the mean waiting time. 
Because the outside observer's viewpoint and the arriving customer's 
viewpoint are identical in systems with Poisson input, it follows that 
L=E(Q). Hence, E(Q) = XE( W).] Before we establish (4.1), however, let 
us consider some ramifications of these results. 

Suppose first that service Jtimes are exponentially distributed. Then 
a 2 = T 7 and, if we denote by W M the mean waiting time in the M/M/\ 
queue, it follows from (4.2) that 




(4.1) 



and 




(4.2) 



P 



(4.3) 



1-P 



T. 



Now suppose that service times are constant. Then o 2 =0, and if we 
similarly denote by W D the mean waiting time in the M/D/l queue, it 
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foUows from (4.2) that 



That is, 

W M =2W D (4.5) 

(and, similarly, the mean queue length in M/M/\ is twice that in 
M/D/l). This vividly illustrates the effect of the form of the service- time 
distribution function on the queue length and waiting times in the M/G/\ 
queue. [It is remarkable to observe, however, that although the time spent 
waiting in the queue does depend on the service-time distribution function, 
the probability that a customer will in fact wait in the queue does not; this 
probability equals p for any equilibrium M/G/l queue. To see this, recall 
Equation (4.11) of Chapter 3: \ — P Q = At. Now, P — II q because we have 
Poisson input. Hence, I — JT = p, as asserted.] 

The importance of the variance term in the formulas (4.1) and (4.2) is 
underscored by the following observation: An M/G/l queue with finite 
mean service time, no matter how small, will have infinite mean queue 
length and infinite mean waiting time if the variance of the service-time 
distribution function is not finite. 

We now give the promised derivation of (4.1), using a clever, though 
nonintuitive, argument due to Kendall [1951]. The trick is to write the 
equations from the viewpoint of the departing customer, and then interpret 
the results to describe the arriving customer's viewpoint. 

Let N£ be the number of customers in the system (including any 
customer in service) just after the service completion epoch of the Arth 
departing customer, and let X k be the number of customers that arrive 
during the service time of this customer. Clearly the random variables X u 
A^j.-.are mutually independent and identically distributed, and X k is 
independent of iVf , N%, . . . , 7V£_ , . 

If the kth. departing customer does not leave the system empty, then the 
(k + l)th departure will leave behind those same customers in the queue 
left by the Arth departure except for himself, plus all those customers who 
arrived during the service time of the (k + l)th customer. Thus 

N£ + i-Nt-l + X k+l (JV*>0). (4.6) 

On the other hand, if the kth customer leaves the system empty, then the 
(Ar-rT)th customer will leave behind just those customers who arrived 
during his service time. Thus 



2(1 -p) 



(4.4) 



(4.7) 
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Equations (4.6) and (4.7) can be combined into a single equation 

^r^-W + ^ + .» (4-8) 
where the random variable 8(N*) is defined as 

«*-(: " T>°°. 

We now square both sides of (4.8): 

NZl i - + * \K) + X} + x + 2N*X k+l 

-2X k+l 8(Ng)-2Ng8(Ng). (4.10) 

It follows from the definition (4.9) that 

8XNZ) = 8(NZ) (4.11) 

and 

NZ8(NZ) = NZ, (4.12) 
so that Equation (4.10) can be written 

N& . - + W) + XL 1 + 2N£X k + 1 

We now take expected values through (4.13), Since N£ and X k+l are 
independent, we have 

E(Nf + ,) = E(Nf) + E{S(N*)) + E(X} + x ) 

+ 2E(N k *)E(X k+l ) - 2E(X k+i )E(8(N*)) 
-2E(N£). (4.14) 

Now let k-*<x> in (4.14). Assuming that a statistical-equilibrium distribu- 
tion exists, then in equilibrium the distribution of the number of customers 
left behind by each customer is the same. Therefore lim k _ t , <X) E(N^ l ) = 
Km^^ E(N f) = E(N* 2 ) and (4.14) becomes 

E(8(N*))[ I — 2E(X)] + E(X 2 ) 
E{ ' 2[\-E(X)] • (4 • 15, 

Similarly, taking expectations and limits through (4.8), we have 

E(8(N*)) = E(X). (4.16) 
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It remains to calculate E(X) and E(X 2 ). E(X) is simply the mean number 
of arrivals occurring during an arbitrary service time, that is, the offered 
load: 

E(X)=Xr = P . (4.17) 
Direct calculation of E(X 2 ) gives (see Exercise 21a of Chapter 2) 

E(X 2 )=\ 2 (o 2 + T 2 ) + p. (4.18) 
Substitution of Equations (4.16), (4.17), and (4.18) into (4.15) yields 

P 2 ri+(<7 2 /T 2 )i 

Q is the number of customers waiting in the queue just prior to an arrival 
epoch, and N* is the number of customers in the system just after a 
departure epoch. Since the arrival and departure distributions are equal, 
we can equate the mean number E(N*) of customers left by an arbitrary 
departure to the mean number of customers found by an arbitrary arrival. 
The latter quantity is the sum of the mean number p in service and the 
mean number E(Q) waiting for service. Thus 

E(N*) = p + E(Q). (4.20) 

Finally, comparison of (4.19) and (4.20) shows that (4.19) is equivalent to 
(4.1). 



5.5. Riemann-Stieltjes Integrals 

Before we can proceed with our study of queueing theory, we need to 
introduce (or review) a new tool, the Riemann-Stieltjes integral, along with 
the Laplace-Stieltjes transform, which will allow us to discuss discrete and 
continuous random variables within a single mathematical formalism. We 
shall continue to adhere to our policy of reviewing only the minimal 
theoretical background necessary to understand the applications in queue- 
ing theory. The material of this section is taken largely from Chapter 5 of 
Widder [1961]; see also Widder [1941]. 

Consider an interval (a, b) and a set of points x ,.x „...,*„ (a = x <x x < 
x 2 < • • • <x n — b) that partition (a, b) into a subdivision A of n adjoining 
subintervals (*o>*i)>(*i>-*2)> ••*>(■*«- !>■*»)• Th e norm ||A|| of a subdivision A 
is 



([A(| - max(jc l - x Q , x 2 - x v . . . , x n - x n _ t ) ; 
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that is, it is the length of the largest of the subintervals. We are ready to 
define the Riemann-Stieltjes (or, simply, Stieltjes) integral of a function 
g(x) with respect to a function h(x) from a to b: 

( b g(x)dh{x)= lim 2 (5.1) 

J a t|A||-»O fc _i 

where x k _ x <4 (k= 1,2,. ..,n). 

The left-hand side of (5.1) is the notation employed for the Riemann- 
Stieltjes integral. When h(x)= x it reduces to the usual notation of the 
ordinary (Riemann) integral; and, of course, the right-hand side of (5.1) 
reduces to the definition of the Riemann integral. The Riemann-Stieltjes 
integral possesses many properties analogous to those of the Riemann 
integral, including the fact that there is a method of calculation that is 
much easier than the definition. 

A sufficient condition for the existence of the limit on the right-hand 
side c- (5.1) is that g(x) be continuous on [a,b] and h(x) be nondecreasing 
on [a, b]. This will be the case for all applications in this book. 

Essentially all of the elementary properties of the Riemann integral are 
retained by the Riemann-Stieltjes integral. In particular, 

f b dh(x) = h(b)-h(a); (5.2) 

also, the Riemann-Stieltjes integral is linear, that is, 

( b [cigi(x) + c 2 g 2 (x)]dh(x) = c l f b gl (x)dh(x) + c 2 f b g 2 (x)dh(x), 

J a J a 

(5.3) 

where c x and c 2 are arbitrary constants; and for a <c <b, 

f b g(x)dh(x)= f C g(x)dh(x)+ ( b g(x)dh(x). (5.4) 

J a J a J c 

(A table of the elementary properties of the Riemann-Stieltjes integral is 
given on p. 155 of Widder [1961].) 

Calculation of the Riemann-Stieltjes integral is facilitated by the follow- 
ing two results: 

1. Suppose that g(x), h(x), and h'(x) = dh(x)/dx are all continuous on 
[a, b]. Then 

f b g(x)dh{x)= f b g(x)h'(x)dx. (5.5) 
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[Note that the integral on the right-hand side of (5.5) is an ordinary 
Riemann integral. This theorem has, in fact, been assumed from the 
beginning of this text.] 
2. Suppose h(x) is a step function with jumps at the points c k of 
amounts Ah k = h(c k + ) — h(c k — ) (&= 1,2, .. .,«), where a<c 1 <c 2 
< ... <c n <b; and suppose g(x) is continuous on [a, b]. Then 

f b g(x)dh(x)= 2 g(c k )M k . (5.6) 

J a k**\ 

For example, consider again Exercise 27 of Chapter 3. There the reader 
is asked to show that 

P{T>t} = P{W>t\W>0} (5.7) 

for the M/ M/\ queue with service in order of arrival, where T is the 
sojourn time and W is the waiting time. It follows from Equation (4.24) of 
Chapter 3 that 

P{W>t\W>0} = e- (l - a)>tt . 
Therefore, the problem can be restated as follows: Show that 

P{T>t} = e-<- l - a ^ (*>0), (5.8) 
where T= W+ X (sojourn time equals waiting time plus service time) and 

« <0) ' (5.9) 
>0), 



[ (/ 
P{X<t} = H(t)={ , ) 



P{W<t} = W(t)^[ ° w [ t<0 } (5.10) 

Note that the waiting-time distribution function (5.10), which follows from 
(4.25) of Chapter 3, has a jump of amount 1 — a at the origin. 
From probabilistic considerations, we can write 

P{T>t}= f°° \\-H(t-x)]dW(x) (5.11) 

J CO 

or, equivalently, 

[\-W(t-x)]dH(x). (5.12) 
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A reader with no knowledge of Riemann-Stieltjes integration could have 
solved the problem by rewriting (5.12) as 

P { T > t } = j ' [ 1 - W{t - x) ] H '{ x) dx + j °° H '{x) dx 
J o 

from which (5.8) easily follows. However, in order to use (5.11), one must 
be cognizant of the theory of Riemann-Stieltjes integration to account for 
the effect of the jump in the function W(x) at the origin. As an example, 
we shall evaluate the right-hand side of (5.11). 

First, we break the integral on the right-hand side of (5.11) into three 
integrals, ranging over (— oo,0— e), [0 — e,t], and (/,oo), where e is any 
positive number. Observe that 

f°~ \ 1 - H{t - x) ] dW{x) - (5. 13) 

because, according to (5.10), W(x) = for -oo<x<0, and therefore 
dW(x)= W'(x)dx = 0. Now observe that 

f°°[l-H(t-x)]dW(x)= f°° dW(x) = l- W(t) = ae- (1 - a)lxt , (5.14) 

where the first equality in (5.14) follows from the fact (5.9) that H(y) = 
when y < 0; the second equality follows from (5.2); and the third follows 
from (5.10). Hence, we have reduced (5.11) to 

P{T>t}=C e-^'-^dW^ + ae-V-W. (5.15) 

J 0-e 

To evaluate the integral in (5.15) we observe that W(x) has a jump of 
amount I- a at x = 0, and dW{x)= W'(x)dx = a(\ -a^-^'^dx for 
x > 0. Therefore 

r e-l'-^dWW-e-rt'-^X-a)* f 'e""*' -*><*( 1 - a)^-"^ dx. 

J 0-e J 

(5.16) 

It is now a simple matter to evaluate the ordinary Riemann integral on the 
right-hand side of (5.16), and to combine with (5.15) and simplify to get 
(5.8). 

It is perhaps worthwhile at this point to indicate a different proof of 
(5.7) that requires no integration. Let N be the number of customers found 
by an arbitrary arrival in an equilibrium M/M/l queue with offered load 
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a. Then, from Equations (4.3)-(4.6) of Chapter 3, we have 

P{N=j} = (l-a)aJ 0-0,1,...). (5.17) 

Now 

P{T>t}= 2 P{T>t\N=j}P{N=j] (5.18) 

and 

00 

P{W>t}= 2 P{W>t\N=j+\}P{N^j+\). (5.19) 

Also, 

P{W>t\W>0}= l)™^ =\p{W>t); (5.20) 
therefore, it follows from (5.19) that (5.20) can be written 

P{W>t\W>0} = - 2 P{W>t\N-j+l}P{N-j + \). (5.21) 

« j-0 

Substitution of (5.17) into (5.18) and (5.21) gives, respectively, 

P{T>t} = 2 P{T>t\N=j}{\-a)a j (5.22) 

and 

P{W>t\W>0}= 2 i > {W^>^=7'+l}(l-«V. (5.23) 

7-0 

But 

P{T>t\N=j} = P{W>t\N=j+l}, (5.24) 

because when N—j, then r is a sum of y+1 service times; and when 
N=j+ 1, then W is a sum of j+ 1 service times. Therefore, the right-hand 
sides of (5.22) and (5.23) are the same; hence (5.7) is true, as asserted. 

Exercise 

3. Solve Exercise 18 of Chapter 2 by evaluating the integral 

F It {y)^jF Ri {y-x)dF At {x). 
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5.6. Laplace-Stieltjes Transforms 

The ordinary Laplace transform should be familiar to every student of 
engineering and applied mathematics. In this section we outline the prop- 
erties of a generalization, called the Laplace-Stieltjes transform, which has 
proven to be very useful in applied probability and queueing theory. For 
an introduction to the theory of transforms, see Widder [1971]; and for a 
comprehensive discussion in the context of probability theory, see Chapter 
XIII of Feller [1971]. 

The Laplace-Stieltjes transform <^s) of a distribution function F(i) of a 
nonnegative random variable [that is, F(t)=0 when f <0] is defined as 

<j>(s) = f X e- s, dF{i) (s>0). (6.1) 

Note that the lower limit of the integral (6.1) is defined so as to include the 
jump at the origin when F(G)>0. For convenience, throughout the re- 
mainder of the text we shall adopt the convention that / is to be 
interpreted as / _ ; that is, we shall always assume, unless stated otherwise, 
that any jump at the origin is included. Thus, for example, from now on we 
shall write (6.1) as <)>(s) = J>~ sf dF(t). Observe that <j>(o)= 1. 

It turns out that the Laplace-Stieltjes transform of the distribution 
function of an arbitrary nonnegative random variable shares many of the 
properties of the probability-generating function of an arbitrary nonnega- 
tive integer-valued random variable, and this fact accounts for its useful- 
ness in queueing theory. [The close analogy between the Laplace-Stieltjes 
transform <f>(s) and the probability-generating function g(z) is illuminated 
by Exercise 4, where the reader is asked to show that <}>(s) differs from g(z) 
only by the change of variable z = e~ s ; that is, <i>(s) = g(e~ s ).] 

We first observe that if F(t) is differentiate, F\i)=f(t), then <f>(j) is the 
ordinary Laplace transform of /(f): 

<t>(s) = re-«dF(t)= r e -y(t)dt. (6.2) 

That is, the Laplace transform of a density function equals the Laplace- 
Stieltjes transform of the corresponding distribution function. (We remark 
that the names Laplace-Stieltjes transform and Laplace transform are not 
used universally as we use them here.) 

Like the ordinary Laplace transform (and the probability-generating 
function), the Laplace-Stieltjes transform maps the object function, of a 
variable /, into another function, of a dummy variable s. In order for any 
transform to be useful, of course, its properties must be such that (1) 
manipulations in transform space are easier than in the original space, and 
(2) the answer in transform space determines uniquely the answer in the 
original space. We will now list some properties of the Laplace-Stieltjes 
transform that demonstrate requirement (1); but we merely assert that 
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requirement (2) is met [if <j>(s) is known for all s > 0] and refer the reader to 
Feller [1971] and Widder [1971] for discussion of the uniqueness of the 
inverse transformation. Jagerman [1978] discusses numerical inversion of 
the Laplace transform, and illustrates his method with examples drawn 
from queueing theory. 

1. The moments of the distribution function F(t) can be obtained 
directly from its Laplace-Stieltjes transform by differentiation: 

/>*W-<-l)-(£«4_ o . (") 

Equation (6.3) is easily verified by formal differentiation of (6.1). 

2. If F x (i) and F 2 (t) have Laplace-Stieltjes transforms <j>i(s) and 4> 2 (s), 
respectively, and F(t) is the convolution 

F(t)= [ t F l (t~x)dF 2 (x)= f t F 2 (t-x)dF i (x), (6.4) 

then the Laplace-Stieltjes transform <j>(s) of F{t) is the product 

tfjWifctoifr). (6.5) 

Thus, if X x and X 2 are independent, nonnegative random variables whose 
distribution functions have Laplace-Stieltjes transforms <f> x (s) and <$> 2 (s\ 
then the distribution function of the sum X x + X 2 has Laplace-Stieltjes 
transform <(> 1 (s)<p 2 (s). To prove this, observe that the transform (6. 1) can be 
written as the expectation $(s) = E(e~ sX ), where F(t) is the distribution 
function of X. Hence, if X = X l +X 2 , then <^(s) = = 
E(e- sX *e- sX2 ) = E(e- sX ^E(e~ sX3 ) = ^(s)^ 2 (s), where the next to last 
equality follows from the independence of X x and X 2 . 

3. Of particular interest in queueing theory is the relationship between 
the generating function of the number of events that occur according to a 
Poisson process during a time interval whose length is a random variable 
and the Laplace-Stieltjes transform of the distribution function of the 
interval length. Thus, suppose that events occur according to a Poisson 
process with rate A, and let Pj be the probability that exactly j events occur 
during a time interval whose length is a random variable with distribution 
function F(t). Then 

Prr^T-e-^dFit) = 0,1,...). (6.6) 
-'o J- 

If the distribution {Pj} has the probability-generating function g(z) = 
'EjLqPjZ', then it follows from (6.6), after interchanging the order of 
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summation and integration, that 



«M-ri^«-vrfFw. (6.7) 

•'o y_o J- 

Observe that the integrand in (6.7) is the probability-generating function of 
a Poisson variable; hence, from Equation (4.3) of Chapter 2, Equation (6.7) 
can be written 

g(z) = f e-< K -**>'dF(t). (6.8) 
•'o 

Now, if F(t) has Laplace-Stieltjes transform <f>(s), 

<*>(*) = r e - s 'dF(t), (6.9) 

then it follows from comparison of (6.8) and (6.9) that 

g(z) = <p(\-\z). (6.10) 
Equation (6.10) will prove to be a most useful result. 



Exercises 

4. Let N be a nonnegative, integer-valued random variable with distribution 
function F(t). If $( s ) * s the Laplace-Stieltjes transform of F(t) [4>(s) = 
jQe~ st dF(t)] and g(z) is the probability-generating function of N [g(z) = 
^f. P{N=j}zJl show that j(s)~g(e-'). 

5. Consider again the premise of Exercise 4 of Chapter 2, but assume now that 
the distribution function of X, r (/' = 1,2,...) has Laplace-Stieltjes transform tft(s) 
(that is, Xf is no longer required to be discrete). Show that, in analogy with part 
a, S N has Laplace-Stieltjes transform g(<j>(s)), and that part b remains correct. 

6. We shall show in Section 5.8 that the equilibrium waiting-time distribution 
function W(i) in the M/G/\ queue with service in order of arrival has 
Laplace-Stieltjes transform co(j) = f™e~ st dW(t) given by 

w <» ln" P / m > 0) 

j-A[1-tj(j)] 

where vis) — <#/(/) is the Laplace-Stieltjes transform of the service-time 

distribution function H(t), with mean t = ffidH{t\ and p=Ar<l, where A is 
the arrival rate. 

a. Use the fact that the waiting time W has mean value E{ W) given by 
E(W)= -w'(0) to obtain Equation (4.2), which we restate here for con- 
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venience: 



E{W) = 



2(1 -p) 




(2) 



where a 2 is the service-time variance, 
b. We know that when service times are exponentially distributed with mean 
then the waiting-time distribution function W(t) is given by [see, for 
example, Equation (5.10) with a = p] 



Calculate u(s) from (3) and show that it agrees with (1) when H(t) =\ — e 
(/>0). Conclude that (3) is the inverse of (1) when service times are 
exponential. 

The Laplace-Stieltjes transform (1) of the equilibrium waiting-time distribution 
function in the M/G/l queue with order-of -arrival service is known as the 
Pollaczek-Khintchine formula. The corresponding result (2) for the mean wait- 
ing time (which is valid for any nonbiased queue discipline) is also often 
referred to by the same name. 

5.7. Some Results from Renewal Theory 

During our discussion of the Poisson process in Section 2.5 the following 
phenomenon was noted: If the mean time between adjacent arrival epochs 
in a Poisson process is A -1 , then an outside observer who samples 
(observes) at a random instant will see that the forward recurrence time 
(the elapsed time between his sampling point and the arrival epoch of the 
next customer) also has mean A -1 , instead of ^A -1 as one might naively 
expect. That this mean value exceeds was interpreted as a reflection 
of the fact that the observer would be more likely to sample during a long 
interarrival interval than during a short one. In the particular case of a 
Poisson process, where the interarrival times are exponentially distributed, 
we obtained this result about the forward recurrence time directly from 
consideration of the Markov property of the exponential distribution. In 
the more general case where the distribution of interarrival times is not 
restricted to the exponential, again an observer would tend to sample 
during a long interarrival interval, but more sophisticated mathematical 
tools are necessary for its analysis. In this section we review some results 
from renewal theory that are applicable to the analysis of this and similar 
questions for processes in which the interevent times are independent, 
identically distributed, positive random variables, with an otherwise arbi- 
trary distribution function. Recommended references on renewal theory 
are Cohen [1969], Feller [1971], Neuts [1973], and Ross [1970]. We note 
that a rigorous treatment of the subject of this section requires some 




C<0), 
(r>0). 



(3) 
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delicate arguments concerning limits that are outside the scope and spirit 
of this text. The reader who requires a more precise treatment of this 
subject should consult the references. 

Suppose that X u X 2 ,... is a sequence of independent, identically distrib- 
uted, positive random variables, with distribution function G(x)~ P{X j < 
x} and mean value fl = foXdG(x). The sequence {X,} is said to be an 
ordinary renewal sequence. In a common queueing-theory application, the X/s 
are interarrival times, and the corresponding arrival process is called renewal 
input or recurrent input. 

Suppose we have a renewal process in which events or renewals 
(customer arrivals, say) occur at epochs T u r 2 ,...,with X i =T i -T i _ x 
(/= 1,2,...; r =0). Let N(t) be the number of renewals (arrivals) in (0,/], 
and let S„ = X l + -- +X n =T n . Then the events [N(t)>n] and {S„<t} 
are equivalent; hence 



Of particular interest in renewal theory is the renewal function m(f), defined 
as the mean number of renewals in (0,/]: 



The elementary renewal theorem states the highly intuitive fact that the 
renewal function m(t) satisfies 



P{N(t)>n} = P{S n <t}. 



(7.1) 



(7.2) 



lim 



m(t) _ 1 . 



(7.3) 



t p* 



that is, in the language of asymptotic analysis, the mean number of 
renewals in (0,t] is asymptotic to t//3: 



m(t)~ 



t 



(7.4) 



fi' 



We shall also need the fact that 



m{ t )= J P[Sj<t). 



(7.5) 



7-1 



To prove (7.5), note that 



I P{Sj<t}= %P{N(t)>j}- f % P{N{t)-k], (7.6) 
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where the first equality in (7.6) follows from (7.1). Rearrangement of the 
terms in the right-hand sum of (7.6) yields 

2 2 P{N(t) = k} = 2 iP{ N(t) = 1} = E(N(t)), (7.7) 
y-i k-j j-i 

which establishes (7.5). 

Now let Pj{t) be the probability that exactly j renewals occur in the 
interval (T k , T k + t]. Note that the point T k is either a renewal point (k > 1) 
or the starting point of the process (k = 0), and that T k is not included in 
the interval. In particular, Pj(t) is the probability of j renewals in (0, t]. Let 
G* J (t) be the >fold convolution of G with itself, and define G*°(t)= 1 and 
G* l (t)=G(t). Then, since 

G*J(t) = P{Sj<t] 

and 

P{N(t)=j}-P{ N(t) >j) - P{N(t) >j + 1 }, 
it follows from (7.1) that 

Pj{t) = G*J{t)-G* +x \t) (j = 0, 1,...). (7.8) 

For example, if the interarrival times are exponentially distributed, 
G(t)=\ — then G* J (t) is the y-phase Erlangian distribution function, 

i-0 l - 

and (7.8) gives, of course, the Poisson probabilities, 

PAt)=Qfe-*< = 0,1,...). 

Our main result of this section concerns the limiting distribution of the 
forward recurrence time R t , defined as the elapsed time from an arbitrary 
instant t to the next renewal epoch. If we define 

lim P{R t *kx} = F{x), 

then we shall show that 



F(*)=i/ o X [l-G(0]^. 



(7.9) 
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Thus, (7.9) gives the distribution of the remaining time from an arbitrary 
observation point t to the next renewal epoch in an equilibrium renewal 
process (one that has been in existence infinitely long at time 

Exercise 

7. Show that if G(&-1 e~ X£ in (7.9), then F{x)—\ e'**; that is, if the 
interevent times are exponentially distributed, the forward recurrence times 
have the same distribution as the interevent times. 

Before proceeding to a proof of (7.9), let us consider some of its 
ramifications. The distribution function F(x) defined by (7.9) has 
Laplace-Stieltjes transform <K- S ) = Io e ~ sx dF(x) given by 

^-^^-T^* (710) 

where y(s) = /*e~ JJC dG(x) is the Laplace-Stieltjes transform of the distribu- 
tion function of times between renewals. To prove (7.10), note that from 
(7.9) we can write 

<t>(s)= r e - sx dF(x)=±- Ce-* x [\-G{x)}dx 

= -/ dG(y)dx, (7.11) 

where we have used the fact that 1 — (?(*) — f™dG(y). If we interchange the 
order of integration on the right-hand side of (7.11), we have 

= ^rr e ~ 3X dxdG{yl (7. 12) 

and (7.10) follows from straightforward evaluation of (7.12). 

We leave it for the reader to calculate the mean ($* = f^xdF(x) of the 
equilibrium forward recurrence time from the transform (7.10): /?* = 
-$'(0), that is, 

r-h£> ( ? - 13 > 

where a 2 is the variance of the interevent times. 
Exercise 

8. Verify Equation (7.13). 



Equation (7,13) shows, for example, that the mean equilibrium forward 
recurrence time is one-half of the mean interevent time (as the naive 
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+ 1 



t + x 



Figure 5.1. 



person might guess) if and only if a 2 — 0, that is, when the interevent times 
are constant. Likewise, it follows from (7.13) that if the interevent times are 
exponential (that is, if a 2 -B 2 ), then B* = B, as required by the Markov 
property of the exponential distribution. 

To prove (7.9), note that the recurrence time R t will not exceed the value 
jc if and only if at least one renewal occurs between t and t + x. Let j be 
the value of the index of the last renewal to occur prior to epoch t + x. 
Then Rf<x if and only if for some value of the index j (7 = 1,2,...) we 
have t<Tj<t + x\ and since 7} is the time of the last renewal prior to t + x, 
then it follows that t + x<T J+l . That is, as Figure 5.1. illustrates, R t <x if 
and only if the point t + x is bracketed by two successive renewal epochs, 
the first of which lies between t and t + x: t<Tj <t + x<T j+v Therefore, 

P{R^x}= f f t+X [\-G(t + x-y)]dP{T j <y}. (7.14) 

If we interchange the order of summation and integration in (7.14), and 
use the fact that T J = X l + ■ - ■ +Xj= Sj, we can write 

P{R l <x} = f t t+x [l-G(t + x-y)]d^P{S j <y} ) j; (7.15) 

and it follows from (7.5) that (7.15) can be written 

P{R t <x}= f t + X [l-G(t + x-y)]dm(y). (7.16) 

It is a consequence of the key renewal theorem (see the references given at 
the beginning of this section) that 

Hm j t+ "[ 1 - G(t + * -y) ] dm(y) = j jf *[ 1 - <?(£) ] </& (7.17) 

and (7.9) is established. A heuristic argument for (7.17) asserts that as 
/->oo so does >>; therefore, it follows from (7.4) that one can replace m(y) 
on the left-hand side of (7.17) byy/B; that is, dm(y)~(\/ B)dy. Hence we 
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can write 

\™f' + \\-G(t + x-y)]dm(y) = ±f t+X [\-G(t + x-y)]dy. (7.18) 

Now, if we make the substitution t + x~y =£ in the right-hand integral of 
(7.18), wc obtain (7.17). 

Thus far we have concerned ourselves with the forward recurrence time 
R r which is the elapsed time from / until the next renewal. We now define 
the covering interval I, as the interval defined by the renewal epochs that 
bracket the point /, and the age A t of the covering interval at time t as the 
elapsed time between / and the preceding renewal epoch. The age is 
referred to also as the backward recurrence time. Clearly, I t - A t + R t \ 
however, the random variables A t and R t are not, in general, independent. 

In Exercise 9 the reader is asked to verify that 

lim dP{I t <x} = ^xdG(x). (7.19) 

t—*ao p 

L. Takacs has given the following intuitive interpretation of the right-hand 
side of (7.19), which is the "density" function of the length of the 
interval that covers an arbitrary point / in an equilibrium renewal process: 
dG(x) is the frequency with which intervals of length x occur in the general 
population of intervals defined by our renewal process. We imagine that 
an observer chooses an interval by observing the process at time t and 
selecting the interval that contains the observation point t. We have 
already argued that an observer sampling without regard to the underlying 
process is more likely to sample during a long interval than a short one; 
that is, the point / is more likely to be contained in a long interval than a 
short one. If the frequency dG(x), say, with which such an interval has 
length x,is proportional to the product of the frequency of occurrence of 
intervals of length x in the general population and the length x, then 
dG(x) = cxdG(x), where c is a constant. If we require that j^dGix)— 1, 
then c = p- 1 and (7.19) follows. 

Other interesting facts, which the reader is asked to verify in Exercise 9, 
are 

}imP{R t <x,I t <y} = ±fJ[G(y)-G(&]d£ (0<x<y), (7.20) 

lim P{R t <x\I t =y} = - (0<x<y), (7.21) 

/-»<» y 

^ F { Rt>x ' A > >y } = \r l l ~ G (0]dl (7.22) 
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Equation (7.20) gives, for an equilibrium renewal process, the joint distrib- 
ution function of the forward recurrence time and the length of the 
covering interval. Equation (7.21) shows that the location of the point t is 
uniformly distributed over the length of the covering interval. And (7.22) 
shows easily (by virtue of its symmetry in x and y) that, in an equilibrium 
renewal process, the forward recurrence time and the backward recurrence 
time are identically distributed. 

Exercises 

9. a. Show that 

P{R t <x,I t <y} = 2 f'~ y + X [G(y)-G(t~0]dP{T J <^ 

+ 2 f [G(t-t+x)-G(t-0]dP{Tj<Z} 
(0<x <>></)• 

[Hint: Let 7} be the time of occurrence of the last renewal prior to time t.) 

b. Using the same kind of heuristic argument by which (7.9) was derived from 
(7.14), show that 

lim P{R t <x,I t <y} = l f y ~ X [G(t + x)-G(OW 

+ [G(y)-G(mdt 

and show that this implies (7.20). 

c. Show that the distribution function of the length of the covering interval in 
an equilibrium renewal process, obtained by setting x —y in (7.20), is given 
by 

Urn P{I t<y }=* ( y ZdG(i\ 

t-KX> P J 

and thereby deduce (7.19). 

d. Prove (7.21). 

e. Show that (7.20) and (7.22) are equivalent. 

f. Use (7.22) to show that if the equilibrium renewal process is a Poisson 
process, then the forward recurrence time and the backward recurrence time 
are independent. 

10. Customers arrive at a single server according to a renewal process with 
interarrival-time distribution function G(t). The service-time distribution 
function is H{t), and blocked customers are cleared. 

a. Define the cycle time as the time interval separating two successive epochs 
at which the server becomes busy, and let F(x) be the cycle-time distribu- 
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tion function. Show that 

*X*)-2 (* ( X [\-G(x-y)]dG*J(y)dH(t). (1) 

[Hint: Define R t as the duration of time from a service completion epoch 
(at time /) to the next arrival epoch, so that F(x)- foP{R, <x-t}dH(t).] 

In parts b-h, assume H(t)= 1 - e"^ (t > 0). 

b. Let <f>(s) = J™e~'*dF(x) be the Laplace-Stieltjes transform of F(x), and 
show that 

where y(s) — foe~ st dG(t) is the Laplace-Stieltjes transform of G(t), and 
G(0)=0. 

c Let a denote the mean cycle time, and show that 

«- -^(0) , (3) 

d. Let P be the equilibrium probability that the server is busy, and show that 

J , -[-y'(0)mJ" 1 [i-y(m)]. (4) 

e. Let JT be the equilibrium probability that an arbitrary arrival finds the 
server busy, and show that 

17= -K M ). (5) 

f. Equations (4) and (5) imply 

i>-[-Y'(o)j*r ! (i-n). (6) 

Interpret this result in terms of offered and carried loads. 

g. Show that if G(t) = 1 - e (t > 0), then 

i. P=iT; 

ii. the cycle time is the sum of two independent exponential random 
variables, with parameters X and j», respectively. 

h. Argue from first principles that if the interarrival times are of constant 
length t, then 

F(x) = l-e-* T [jr<x<(j+\)r; y-0,1,2,...]. (7) 

Calculate 4>(.s)*= f^e - ** dF(x) from (7), and show that this agrees with (2). 

11. Customers arrive according to a renewal process at a group (finite or infinite) 
of servers with exponential service times. The servers are numbered 
1,2,..., and each arriving customer takes the lowest-numbered idle server 
(ordered hunt). Thus, the input stream at the (/+ l)th server is the overflow 
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stream from the ith server. (If the number of servers is finite, then blocked 
customers are cleared.) 

a. Show that if the arrival stream at a server is a renewal process, then the 
overflow stream from that server is also a renewal process. 

b. Let (7,(0 be the distribution function of times between successive overflows 
from the ith server, and show that 

W)- fV^ + O-e-^C'-*)]^-^) = 1,2,...), (1) 
•'o 

where G (t) is the distribution function of interarrivai times at the first 
server, and n~ 1 is the mean service time. 

c. If Y/Cs) is the Laplace-Stieltjes transform of G ( (t), show that 

( \ Y/-1O + /*) /• i t \ /<j\ 



Equation (2) was first derived by Palm [1943], and (1) by Takacs [1959], who 
extended Palm's work. £inlar and Disney [1967] have studied the more 
general model in which there are n waiting positions; they obtain (2) when 
n — Q. Descloux [1970] has shown how systematic application of renewal 
theory can further simplify and extend these results. 



5.8. The M/G/l Queue 

We have now developed sufficient mathematical machinery to derive 
easily many results for the equilibrium M/G/l queue and some of its 
variants, which may well be the most widely studied class of queueing 
models. 

A precise description of the M/G/l queueing model is this: (1) 
Customers arrive according to a Poisson process. (2) There is a single 
server. (3) An arriving customer who finds the server idle begins service 
immediately. (4) All blocked customers wait until served, (5) The server 
cannot be idle when there is a waiting customer. (6) Service times are 
identically distributed, nonnegative random variables, independent of the 
arrival process and each other. 

We shall denote the arrival rate by X and the service-time distribution 
function by H{t), with mean t = f^t dH(t). Also, we define the traffic 
intensity (or offered load) p as the product p=At. We shall assume 
throughout that p<l; that is, the arrival rate is less than the service 
completion rate. (A consequence of this assumption is that the system will, 
for all practical purposes, attain statistical equilibrium after a sufficiently 
long time.) Our objective is to obtain results that describe the statistical 
characteristics of the M/G/l queue in equilibrium. 
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The Mean Waiting Time 

We begin with a direct derivation of the famous Pollaczek-Khintchine 
formula for the expected value of the waiting time W experienced by an 
arbitrary customer in the equilibrium M / G/l queue, 

where a 2 is the variance of the service-time distribution function. Equation 
(8. 1) is simply a rewriting of Equation (4.2), which was derived by a rather 
indirect method, and its ramifications discussed, in Section 5.4. It is the 
purpose of the present derivation to show that one can "explain" the 
presence of the mysterious and important variance term in the light of 
renewal theory. 

We begin by expressing the expected waiting time E{ W) for an arbitrary 
customer (the test customer) in terms of the conditional expected waiting 
time, conditioned on whether or not the test customer is blocked: 

E{ W) - E{ W\ W > 0)P { W > 0} + E{ W\ W= 0)P { W =0} . (8.2) 

The second term in (8.2) is obviously zero. We have argued previously (in 
Section 5.4) that 1 — II = p, where JT is the equilibrium probability that an 
arriving customer finds the system empty. Hence P {W >0) —p, and 
Equation (8.2) reduces to 

E(W)= P E(W\W>Q). (8.3) 

Now, conditional on the event {W>0} that the test customer must wait 
for service, the waiting time is given by 

W=X X +--- +X Q + R (W>0), (8.4) 

where Q is the number of customers (if any) waiting in the queue when the 
test customer arrives, given that W>0; X t is the service time of the /th 
customer in the queue; and R is the remaining service time of the customer 
in service when the test customer arrives. 

By assumption 2?(JQ = r, and the X t 's are independent of Q. Therefore, 

E{X x +-> + X q \W>0) = tE(Q\W>0). (8.5) 

Hence, if we take expected values in Equation (8.4), we get 



E(W\W>0) = tE(Q\W>0) + E(R\W>0). 



(8.6) 
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Now, by the same argument used to derive (8.3), we can also establish 

E(Q) = P E(Q\W>0). (8.7) 

Applying the equation L=\W as in Section 5.4, we know that E(Q) = 
\E(W). Hence, it follows from (8.7) that 

tE{Q\W>0) = E{W). (8.8) 

Therefore, (8.6) can be written, interestingly, as 

E( W\ W > 0) - E( W) + E(R ), (8.9) 
where, of course, E(R) = E(R \ W>0); and thus (8.3) becomes 

E{ W)= j^--E(R). (8.10) 

To evaluate E(R\ we note that the renewal-theory argument of Section 
5.7 is applicable. That is, the sequence of service times forms a renewal 
process (ignoring any idle times between service times), and since the 
system is assumed to be in equilibrium, R is the equilibrium forward 
recurrence time from an arbitrary point (the arrival epoch of the test 
customer) to the next renewal (end of the current service time). Thus 
Equation (7.13) is applicable; that is, E(R) = (3* with /? = t: 

<«•») 

(For a rigorous justification of this argument, see Exercise 14.) Substitution 
of (8.11) into (8.10) gives the Pollaczek-Khintchine formula (8.1); and the 
variance term is seen to reflect the fact that the test customer tends to 
arrive during a service time that is longer, on the average, than an arbitrary 
service time. 



The Imbedded Markov Chain 

We shall now analyze the equilibrium M/G/l queue in more depth, using 
the concept of the imbedded Markov chain, an idea of great importance in 
queueing theory, which was introduced by Kendall [1951, 1953]. In par- 
ticular, we shall derive the following expression for the probability- 
generating function g(z) of the number of customers left behind by an 
arbitrary departing customer in an equilibrium M/G/l system: 
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where 17(5) is the Laplace-Stieltjes transform of the service-time distribution 
function. 

In Chapter 2 we saw that an important characteristic of a birth-and- 
death process is that if at any instant t the system state E. is known, then in 
principle the state probabilities for all t + x (x> 0) are also known, inde- 
pendent of whatever route the system may have followed in reaching state 
E i at time t. When this is true, we may write equations relating the state 
probabilities at any time t to those at any other time t + x. We showed that 
a necessary and sufficient condition for a system to have this property, 
called the Markov property, is that the time between successive changes of 
state be exponentially (but not necessarily identically) distributed. A pro- 
cess that has this characteristic Markov property is called (naturally) a 
Markov process. An example is the number of customers in the M/M/s 
(Erlang delay) system, which was discussed in Chapter 3. 

Let us consider the birth-and-death process again, and see why the 
M/G/l queue, unlike the M/M/s queue, cannot be modeled as a 
birth-and-death process. Let N(t) be the number of customers in the 
system at time t. Then, by the theorem of total probability, 

00 

P{N(t + x)-j}- 2 P{N(t + x)-J\N(t)~i)P{N(t)-i) 
i-0 

(t>Q, x>0; / = 0,1,...). 

Note that this set of equations is valid for any queueing model. All that is 
required is to evaluate the conditional probabilities P{N(t + x)=>J\N(t) = 
/}, called the transition probabilities, and solve these equations for the state 
probabilities P { N(t) =7} (J = 0, 1, . . . ). 

In the analysis of those queueing models that are birth-and-death 
processes, we wisely chose x small and let jc-»0. Since all the transition 
probabilities except those representing a transition of step size or ± 1 are 
o(x) as x—>0, we were saved from laboriously calculating the transition 
probabilities, only to see most of them become irrelevant as jc-»0. But the 
fact remains that in principle we could have calculated the transition 
probabilities first, simply because the birth-and-death process is a Markov 
process; that is, the time separating successive transitions is exponentially 
distributed, obviating the need to consider how long the system had been 
in state is, at time t, or by what path state E t at time / evolved. 

In the M/G/l queue this is not true; since the service-time distribution 
function is not assumed to be exponential, in order to specify the transition 
probabilities one must know not only what state the system is in at time 
but also how long it has been there. Thus, the above equations are not 
incorrect; they are just inapplicable to the analysis of the M/G/l queue. 

A renewal point of a stochastic process with time parameter * is a time 
epoch at which the Markov property holds. Thus, if / is a renewal point for 
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a given process, then the future evolution of this process depends only on 
the state (appropriately defined) at time t. A Markov chain is a stochastic 
process with a discrete state space, in which every point t in the parameter 
space is a renewal point. 

With regard to the M/G/l queue, the service completion points, which 
are those epochs at which customers complete service and leave the 
system, are renewal points. For whenever a customer leaves the system, 
either the system becomes empty or a previously waiting customer starts 
service; in either case, because of the lack of memory in the (Poisson) 
arrival process, the transition probabilities (and therefore the future evolu- 
tion of the system) depend only on the state, defined here as the number of 
customers in the system at the service completion point. Hence, this 
particular choice of time epochs (the service completion points) and 
definition of state (the number of customers present) permits the applica- 
tion of the (well-developed) theory of Markov chains. This kind of Markov 
chain, whose states are defined at a discrete set of renewal points that are 
imbedded in the continuum of all nonnegative points, is called (naturally) 
an imbedded Markov chain. Following Kendall, we now proceed to use the 
concept of the imbedded Markov chain to study the M/G/l queue in 
equilibrium. 

Before beginning the analysis, we note that the theory of Markov chains 
is widely known, and therefore we feel no need to treat it in this text. Also, 
the only theorem we shall need from the theory of Markov chains is 
intuitively reasonable, which further obviates the need for discussion here. 
A more formal definition of Markov chain is given in this text in Exercise 
9 of Chapter 3. For further discussion, see Cohen [1969], Cinlar [1975], 
Feller [1968], Karlin [1968], Neuts [1973], Ross [1970], or one of the many 
other books that treat this topic. 

Proceeding with the analysis, we define N£ to be the number of 
customers in the system (including any in service, but excluding the 
departing customer) at the instant the A:th customer completes service. 
That is, if T x , T 2 ,... are the successive service completion points, then N% is 
the state of the system at T k + 0. Then, by the theorem of total probability 
we can write 

P{W + i=J}= 2 P{^ +l =j\N* = i}P{N* = i] 

= 0,1,.,.; *=1,2,...). (8.13) 

Consider first the transition probability P{N£ +l =J[N£ = 0). If the kth 
customer leaves the system empty, then the (k+ l)th customer necessarily 
finds the server idle. Hence the number of customers left behind by this 
customer will be all those who will have arrived during his service time. 
Since with probability e'^QJtf /j\ exactly j customers will arrive during 
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any interval of length |, we have 

*W + i-vW-0}- rS&e-*dH(& (j>0), (8.14) 

-'o J- 

where //(£) is the service-time distribution function. Similarly, when JV£ = / 
>0, the (A:+l)th customer will leave behind those same customers left 
behind by the &th customer except for himself, plus all those customers 
who will have arrived during the service time of the (&+l)th customer. 
Therefore 



i-i+i 



(i>0, j>i- 1) (8.15) 

and, of course, 

^{^ + i=K = '}=0 ('"><>, y</-i). (8.16) 



Let us define 



Pj-r^e-VdH® 0=0,1,...). (8.17) 
■'o J- 

Then the transition probabilities are 



Pj if i = 0, 

Pj-i+i if »>0 and J>i~l, (8.18) 
if />0 and j<i-\. 



Note that the transition probabilities are independent of the value of the 
index k, that is, are the same for every pair of successive customers. 

With the transition probabilities thus specified, we turn our attention to 
Equation (8.13), which relates the distribution of the number of customers 
left behind by the (k + l)th departing customer to the distribution of the 

number left behind by the klh departure, for each A: =1,2 From a 

practical point of view, we are rarely interested in this distribution for any 
particular value of k; rather we wish to know the distribution of the 
number of customers left behind by an arbitrary departing customer. 
Clearly, the distribution is different for each k, but it seems reasonable that 
the dependence on k should diminish as k—±cc. Intuitively, one would 
expect that lim A ._ >0O i > {A r *=y} = Ilf should exist under the appropriate 
conditions for the imbedded Markov chain, just as does \im t _ lt00 P{N(t) — 
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j) = Pj for the Markov (birth-and-death) process studied in Chapter 2. The 
distribution is thus a statistical-equilibrium distribution; II* is the 

probability that an arbitrary departing customer leaves behind j other 
customers in a system that has been operating for a sufficiently long period 
of time. 

From a mathematical point of view, it can be shown using the theory of 
Markov chains that a unique proper stationary distribution 

IIf= lim P{NZ=j} = 0,1,...), (8.19) 

k— >oo 

independent of the initial conditions, exists if and only if p<l. (If p> 1, 
then Ilf = for all finite j.) The proof of this statement is the only place 
where the formal theory of Markov chains is relevant. Since both the 
existence of the limiting distribution (8.19) and the requisite condition p< 1 
are obvious in the queueing-theory context, we shall accept this intuitively 
plausible statement without proof. (For a formal proof, see pp. 236-238 of 
Cohen [1969].) Equation (8.13) becomes 

y+i 

nr-Pjiis+^Pj-i+in* o-o,i,...). (8.20) 

i~ 1 

The normalization equation is 

2 n* = \. (8.21) 

We proceed to solve Equations (8.20) and (8.21) for the departing 
customer's distribution {II*}. 
Define the probability-generating function 

2 n*zK (8.22) 
y-o 

Substituting (8.20) into (8.22), we have 

00 oo 7 + 1 

2 pj^+ 2 2 Pj-^n*z J . (8.23) 

For simplicity let us denote by S the double sum in (8.23): 



00 J+l 

s= 2 2 />,-,+. 

y-o i-i 



(8.24) 



The M/G/l Queue 



215 



If we reverse the order of summation in (8.24), we have 

s= 2 n? 2 Pj-i+i**- 2 nr 2 /^* + '-\ (8.25) 



;_1 /--/-I i-1 Ar = 



where the second equality in (8.25) follows from the substitution k =j — i + 
1. Hence, we can write (8.24) as 

S = z~^ f 2 ( 826 > 

If we now define the probability-generating function 

Kz)- ^ Pj z J , (8.27) 

7-0 

then, in light of (8.22) and (8.27), (8.26) can be written 

S = z-'[g{z)~nt]h{z); (8.28) 
and (8.23) can be written 

g(z) = IlSh(z) + z ~ 1 [ g(z) - n* ] h(z). (8.29) 
Solving (8.29) for g(z) yields 

-W- 1 ^* (-0) 

it remains only to find 11$. The normalization equation (8.21) implies 

g(l) = l, (8.31) 
and (8.30) and (8.31) yield, with a single application of L'Hopital's rule, 

II*=\-p. (8.32) 

[We have previously argued that P(W >Q) — I — II — p; the equality II — 
J2J is consistent with the theorem of Section 5.3.] 

Finally, we note that if we define tj(s) as the Laplace-Stieltjes transform 
of the service-time distribution function, 

ilCO- re-*dH(®, (8.33) 
•'o 
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then substitution of (8.17) into (8.27) gives [see Equation (6.10)] 



h(z) = 7i(\-Xz). 



(8.34) 



And substitution of (8.32) and (8.34) into (8.30) yields (8.12), as promised. 

Thus we have derived an expression (8.12) for the probability-generating 
function (8.22) of the number of customers left behind by an arbitrary 
departure in an M/G/l system in equilibrium. It follows from the theo- 
rem of Section 5.3 that (8.12) also generates the corresponding probabili- 
ties from the viewpoint of an arbitrary arrival; and it follows from the 
assumption of Poisson input that these are also the outside observer's 
probabilities: 



Wc emphasize that, clearly, the distribution generated by (8.12) depends 
on the particular choice of the service-time distribution function; neverthe- 
less, as (8.32) shows, the probability II$ — n o =P does not. Thus, the 
probability that an arrival will find the server busy does not depend on the 
service-time distribution function; but if the arrival does find the server 
busy, then the distribution of the number of other customers he finds 
waiting in the queue does depend on the distribution of service times. 

The mean number of customers present in the equilibrium M/G/l 
system (at a departure point, an arrival point, or an arbitrary point) can be 
found from (8.12) by a routine but tedious calculation (two applications of 
L'HopitaPs rule are required). The result, which we derived previously [see 
(4.19)J by a different argument, is 



where a 2 is the variance of the service-time distribution function. (The 
reader is referred back to Exercise 24 of Chapter 2, where he was asked to 
carry out these calculations.) Equation (8.36) shows, for example, that the 
mean number of customers waiting in the queue, [which equals g'(X) — p, 
the mean number in the system minus the mean number in service] is twice 
as great in the M/M/l queue (where o 2 = t 2 ) as in the M/D/l queue 
(where a 2 = 0), even though the probability that there is a queue at all is the 
same in both cases. 

The Pollaczek-Khintchine Formula 

We now turn to the waiting-time distribution function for the equilibrium 
M/G/l queue when service is in order of arrival. Let W be the waiting 
time of an arbitrary customer, with distribution function W{t)-P{ W<i) 



n*=nj=Pj 0=0,1,2,...). 



(8.35) 




(8.36) 
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and Laplace-Stieltjes transform 

o>(s)= Ce- st dW{t). (8.37) 
•'o 

We shall show that 

*>- ,-M' : &)r (8 ' 38) 

The important result (8.38) is widely known as the Pollaczek-Khintctune 
formula. Using the fact that E{W)= — «'(0)> it is a straightforward but 
tedious task (requiring two applications of L'Hopital's rule) to establish 
that 



a result that is also often referred to as the Pollaczek-Khintchine formula. 
[Equation (8.39) has been derived earlier in this chapter by two indepen- 
dent arguments — see (4.2) and (8.1) — and its ramifications discussed in 
Section 5.4. The reader is also referred back to Exercise 6, where he was 
asked to verify (8.39) by calculating — «'(0), and to invert the transform 
(8.38) when the service times are exponential.] Finally, we shall invert 
(8.38) to obtain the remarkable formula, first found by Benes" [1957], 

W(/)-(l-p)2 ^"{t), (8.40) 

j-0 

where H* J {t) is the j-i old self -convolution (with H* l = H and H*°= 1) of 
the distribution function H(t) defined by 

£(,)=^jT'[ l-H{Z)]dt (8.41) 

We emphasize that Equations (8.38) and (8.40) require that the queue 
discipline be service in order of arrival, but (8.39) remains valid for any 
nonbiased queue discipline. 

To obtain (8.38), let <p(s) be the Laplace-Stieltjes transform of the 
distribution function of the sojourn time (the sum of the waiting time and 
the service time) of an arbitrary customer (the test customer). Since the 
waiting time and the service time of any customer are independent random 
variables, the Laplace-Stieltjes transform of the distribution function of 
their sum is the product of the Laplace-Stieltjes transforms of the compo- 
nent distribution functions; that is, 



<#>(j) = «(,y)»f(j). 



(8.42) 
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Now, <J>0) is the Laplace-Stieltjes transform of the distribution function 
of the total length of time (sojourn time) the test customer spends in the 
system. The distribution of the number of customers left behind by the test 
customer has probability-generating function g(z) given by (8.12). Since 
service is in arrival order, the customers left behind by the test customer 
must all have arrived during his sojourn time. And since the arrivals follow 
a Poisson process with rate A, the generating function of the distribution of 
the number of arrivals during this sojourn time is related to the Laplace- 
Stieltjes transform of the distribution function of the length of the sojourn 
time according to Equation (6.10): 

ir(z) = </>(A-Az). (8.43) 

Equations (8.42) and (8.43) imply g(z)=td(A— Xz)tj(\ — Az); when this ex- 
pression is used in the left-hand side of (8.12) and the factor ri(\ — Az) is 
canceled, we get 

The substitution s = \-\z in (8.44) yields (8.38). 
To derive (8,40), we rewrite (8.38) in the form 

<o(*) = (l-p) 1 



which expands to 



*)-o-P)i^^ 



(8.45) 



Referring back to Equation (7.10), we observe that the factor in parenthe- 
ses in (8.45) is the Laplace-Stieltjes transform, 7j(j) say, of the function 
H(t) defined by (8.41). That is, according to (7.9) and (7.10), if H(t) is 
given by (8.41) and ij(s) = j^€~ st dH(t), then 

^ ) = Ii_3(£). (8 .46) 

T S 

Therefore, Equation (8.45) can be written 

«W = (1-P)ip j '[^)] y - (8-47) 



Finally, term-by-term inversion of (8.47) yields (8.40), because if rj(s) is the 
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transform of H(j\ then ['nC s )} / is the transform of the 7-fold self -convolu- 
tion of H(t). 

The rather mysterious appearance of the function H(t) in (8.40) is 
noteworthy. One can interpret H(i) as the distribution function of the 
elapsed time from the arrival of a customer until the end of the service 
time of the customer (if any) in service when he arrives. In the case of 
exponential service times, H(t)=\-e~ ,a , then H(t) = H(t), and (8.40) 
yields easily to term-by-term interpretation: (1 — p)f^ is the probability that 
the test customer, who arrives at t , say, finds j customers present; and 
H* s {t) is the distribution function of the sum of the service times of the 
customers ahead of him (including the remaining service of the customer, 
if any, in service at * ). But, despite its simple form, (8.40) does not seem to 
yield to similar term-by-term interpretation in the general case. This 
provides a dramatic counterexample to the folk theorem (often true) that 
simple results have simple explanations. 

The Pollaczek-Khintchine formula (8.38) was first obtained by Pollaczek 
[1930] and, independently, by Khintchine [1932]. Interestingly, it was also 
obtained by Cramer [1930], in connection with a problem of insurance risk 
(to which we shall return shortly). 

The elegant derivation given here is due to Kendall [1951]. In Kendall's 
derivation, we write equations for the state probabilities from the 
viewpoint of the fcth customer, and let k-* 00 to yield the distribution {II*} 
of the number of customers left behind by an arbitrary departing 
customer, who leaves after the arrival and departure of an "infinite" 
number of other customers have brought the system into statistical 
equihbrium. Then the distribution function of the time waited by this 
departure is calculated. An analogous (but much more complicated) argu- 
ment that takes the viewpoint of an arriving customer is outlined in 
Exercise 14. 

Khintchine's [1932] method was to denote by W n the waiting time of the 
«th arriving customer and show that for p< 1, 

Urn P{W n <t} = W(t\ (8.48) 

n — >00 

where W(t) has the Laplace-Stieltjes transform (8.38). Pollaczek [1930] 
considered the model where n customers arrive in the interval (0,*), with 
the arrival times chosen independently from a distribution uniform over 
(0,x). He showed that the distribution function of the waiting time of an 
arbitrarily selected customer from this group tends to JV(t), with Laplace- 
Stieltjes transform (8.38), when p< 1 and h-»oo and x->oo in such a way 
that n/x—>\. 

Another approach is to define the virtual waiting time V y as the length of 
time that a customer would have to wait for service to commence if he 
were to arrive at time y. Takacs [1955, 1962a, 1967] has studied this in 
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detail and has shown, among other things, that for p<l, as one would 
expect, 

\im P{V y <t)=W(t\ (8.49) 

where W(t) is the distribution function whose Laplace-Stieltjes transform 
is given by (8.38). A heuristic proof of (8.49) is outlined in Exercise 15. 

Cohen [1969] has given a neat derivation of the Pollaczek-Khintchine 
formula by using the method of collective marks; this procedure is out- 
lined in Exercise 13. 

Finally, let us entertain a slight digression, namely, the insurance risk 
problem alluded to earlier, in connection with which Cramer [1930] de- 
rived a formula that, it turns out, is identical with the Pollaczek- 
Khintchine formula. Suppose that at time u — an insurance company has 
a reserve fund of amount t to cover claims made by its policyholders. We 
assume that the instants at which the claims are made follow a Poisson 
process with rate \, and that the amounts of the claims are mutuall> 
independent, identically distributed, nonnegative random variables with 
distribution function H(g); an d we denote by X(u) the total amount of 
claims made in (0,m). We further assume that the policyholders pay their 
premiums into the reserve fund linearly with time. Thus, if we denote by 
R(u) the size of the reserve fund at time u, we can write 

R(u) = t+ cu- X(u), (8.50) 

where c is the rate at which the policyholders increase the size of the 
reserve fund through payment of their premiums. We can, without loss of 
generality, define the monetary unit so that c = 1 ; then (8.50) can be 
written 

R(u) = t- [X(u)-u]. (8.51) 

The insurance company would like to know the probability p(t) that there 
will always be enough money in the reserve fund to cover the claims made 
against it, when the initial size of the reserve fund is t. Hence, we seek the 
probability p(t) that R(u) > for all u > 0, when R(Q) = /; that is, p(i) is the 
probability that the right-hand side of (8.51) will never become negative: 

p(t) = P{ sup [X(u)-u]<t\. (8.52) 

We now give an elegant argument, due to Takacs [1967, pp. 119-120], to 
show that the right-hand side of (8.52) and the left-hand side of (8.48) are 
equal; that is, the probability that an insurance company that starts with a 
reserve fund of amount / will never run out of money equals the probabil- 
ity that a customer will not wait a time in excess of / for service to 
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commence in an equilibrium M/G/l queue with service in order of 
arrival: 

p(t)=W(t). (8-53) 

Denote by 7] (i — 1,2, . . .) the time of the /th arrival (claim), and let X{u) 
be the sum of the service times (the total amount of claims) associated with 
all the arrivals in (0,«]. Then, clearly, 

sup [X(u)-u]= sup [*(7; + 0)-r,], (8.54) 

0<w<oo 0<i<oo 

where T =0. If we let X; (/ = 1,2,---) be the service time of the ith 

customer (the amount of the ith claim), then ^(7^ + 0)= X x -i +X ( and 

(8.54) can be written 

sup [X(u) ~ u] = sup (X x + • • • + X; - T { ), (8.55) 

0<K<00 0<l'<OO 

where the empty sum is taken to be zero. It follows from (8.52) and (8.55) 
that 

p(t) = p{ sup (X l + -- +X i -T i )<t). (8.56) 

We now show that 

lim P{ W n <t} = P\ sup (X x +--- +X i -T i )<t), (8.57) 

l 0</<oo J 

from which (8.53) follows [by comparison of (8.57) with (8.48) and (8.56)]. 
The random variables W n (it — 1,2,...) satisfy the recurrence relation 

W n + X = max(0, W H + X H -(T m+l -T m )) (« = 1 , 2, . . . ), (8.58) 

where W x =0 (because we assume that the server is idle at time w = 0). 
Expansion of (8.58) yields 

^ n+1 =max(o J J!r„-(7; +1 -7;) J jir w +x n _ 1 -(r n+1 -r ; ,_ 1 ) 3 ..., 

**+••• +X 2 -{T H + l -Tj,X H +-+X 1 -(T m+l -Tj). 

(8.59) 

Now, consider the right-hand side of (8.59) with the random variables 
X n> X n _ x ,...,X x replaced by X x ,X 2 ,...,X n , respectively, and (T n+X -T n ), 
(Tn+\-Tn-\)> ->(T»+\-T\) replaced by r 1 ,r a ,..„7 , ll> respectively; the 
result is a new random variable that has exactly the same distribution as 
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(8.59). Therefore, 

P{W n+x <t} = P{m^{Q,X x -T Xi X^X 2 -T 2 ,..., 

*!+••• + X n ^ l -T n _ l ,X l +--- +x n -T n )<t}. 

(8.60) 

Now let w-»oo in (8.60); the result is (8.57), and the proof of (8.53) is 
complete. Note that the assumption of Poisson input was not used in the 
proof; it follows that (8.53) remains correct for the case when the times at 
which the arrivals (claims) occur follow an arbitrary renewal process, 
where W(t) is now the equilibrium waiting-time distribution function for 
the GI/G/X queue with service in order of arrival. 

For more on insurance risk theory and its relationship to queueing 
theory, see Halmstad [1974], Seal [1969], and Takacs [1967]. 

Exercises 

12. The M/G/\ queue with server vacation times. Consider the equilibrium 
M/G/l queue with the following variation. The server works continuously as 
long as there is at least one customer in the system. When the server finishes 
serving a customer and finds the system empty, it goes away for a length of 
time called a vacation. At the end of the vacation the server returns and 
begins to serve those customers, if any, who have arrived during the vacation. 
If the server finds no customers waiting at the end of a vacation, it im- 
mediately takes another vacation, and continues in this manner until it finds 
at least one waiting customer upon return from a vacation. Let \ be the 
arrival rate, rj(s) be the Laplace-Stieltjes transform of the service-time distri- 
bution function, and 

/(*)= f Jt/V 

be the probability-generating function of the number of customers waiting for 
service at the end of a vacation. 

a. Let = °> !>•••) be the equilibrium probability that an arbitrary de- 
parting customer leaves behind j other customers, and define the probabil- 
ity-generating function 

y-o 

Write the imbedded-Markov-chain equations that determine the distribu- 
tion {II*}, and show that 



(1) 
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that is, 

where g(z) is given by (8.12). 
b. Show that g(z) = g(z) if and only if f(z)=*P(Q) + P(l)z. Explain this result, 
c Find the equilibrium probability-generating function YtTLjZ j of the number 

of customers in the system found by an arbitrary arrival. Show that 

ft n \ 1 - p <9> 



d. Show that 

r<iwo)+^§. (3) 

e. Show that if the vacation lengths are independent of the arrival process, 
then the Laplace-Stieltjes transform u(s) of the waiting-time distribution 
function when service is in order of arrival is given by 

f. If the vacation lengths are independent of the arrival process, find the 
probability that an arriving customer must wait for service. 

g. If W is the length of time an arbitrary customer waits for his service to 
commence, show that 

E(W)=E{W)+J^ y (5) 

where E(W) is given by (8.39). 
b. Suppose that /(z) = z y . Show that Equation (4) reduces to the Pollaczek- 

Khintchine formula when j — Is Equation (4) correct when j>21 Why? 

What about (2), (3), and (5)? 
For more on this model, see Cooper [1970] and Levy and Yechiali [1975], 



13. Derivation of the Pollaczek- Khintchine formula by the method of collective 
marks (Cohen [1969]). 

a. Let g k (z) be the probability-generating function of the number of 
customers left behind by the &th departing customer in the M/G/l queue 
with order-of-arrival service. Show that g k (z) can be interpreted as the 
probability that no marked customers arrive during the sojourn time of the 
kth customer, where it is assumed that arriving customers are marked (or 
not marked) independently and each customer has probability z of not 
being marked. 

b. Define the random variables T k and W k as the sojourn time and waiting 
time, respectively, of the fcth customer; and define the events C k ™ {the /cth 
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customer is marked}, with complement C k , and M'(X) — {no marked 
customers arrive during X), where A' is a (random) time interval. Show 
that 

P{M'{T k )) = P{M\T k ) > C k+l ) + P{M\r k ),C k+ i} 

= P{W k+l = 0,C k + i } + P{M'(W k+l ) ) C' k+l ) 
= P{W k+i =0}(l-z) + P{M'(W k+ i)}*. 

c Show that P{M r (T k )} = <f> k (X-\z) and P{M'(W k+l )) = a k+l (\- Az), 
where <f> k (s) and «#(.?) are the Laplace-Stieltjes transforms of the distribu- 
tion functions of T k and W k , respectively. 

d. Show that these results imply that 

" w - .V-**)] J,(y " 0) ' 

where i\{s) is the Laplace-Stieltjes transform of the service-time distribution 
function, W is the equilibrium waiting time of an arbitrary customer, and 
to(5) is the Laplace-Stieltjes transform of the distribution function of W. 

e. Conclude that Equation (8.38), the Pollaczek-Khintchine formula, is cor- 
rect. 

14. The M/G/l queue from the viewpoint of arrivals. Let N be the number of 
customers in the system just prior to an arbitrary arrival epoch T c , and let R 
be the time needed to complete the service of the customer (if there is one) in 
service at T c . Define 

P{N=j}=II J (7 = 0,1,...), (1) 
P{R<x,N=j}=nj(x) (x>0; y = l,2,...) (2) 

and 

*/*)-f°V" dBjix) (7 = 1,2,...). (3) 

Wishart [1961] (see also Takacs [1963]) has shown that for Xr = p<l the 
generating function u(s,z) of the Laplace-Stieltjes transform (3), 



is given by 



with 



«.,)- ^~:\ > (5 ) 

z — ij(a— Az) s — A(l — z) 



n =\-p, 



(6) 
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where i}(s) is the Laplace-Stieltjes transform of the service-time distribution 
function H(x), r is the mean service time, and \ is the customer arrival rate. 
Finally, inversion of (5) gives 

2 nj(x)zJ= a-p)^ 1 -') r e ~*-'*[H(£+x)-H{Q]dS. (7) 

j_ x ' T\(A-hZ)-Z Jo 

a. Using the normalization equation 

00 

and (5), prove Equation (6). 

b. Starting with Equation (7), show that 

£ J7,(*)=A Hi -//(£)]</£, 
j-i 

and that this in turn implies 

P{R<x\N>l} = - f X [l-H(i))di (8) 

T Jq 



Observe that (8) further justifies the use of (8.11). 
c. Show that 



JZo z-f}(X-Xz) 



Compare Equations (9) and (8.12). 
d. Let W(x) be the waiting-time distribution function for an arbitrary 
customer when service is in order of arrival. Show that (Takacs [1963]) 

w(x)=n +% rnjix-OdH^X®, (10) 

and that therefore W{x) has Laplace-Stieltjes transform co(j): 



Compare Equations (11) and (8.38). 
In parts e-j, the derivation leading to (5) is outlined. 
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e. Argue that 

n = n ( a °e- Xt dH(t)+ f r r e^ +I Un k {y)dH* k {z), (12) 

ni( x ) = n f OO [H(t+x)-H(t)]Xe- x 'dt 
Jo 

+ 2 f "* _A °' + * ) <a3 k O>)<itf* ( *~ ,) (*) 

k-l J J o 

X [^[Hit+x)-^)]^'^, (13) 

nj(x) = jf°°[J3,._ 1 (/+x)-i3;._ 1 (0]A e -^^ 

+ 2 f °° e- X( y +z Un k ( y )dH* k ~j\z) 

k -jJo Jo 

x f ^[^(r + x)-^/)]^-^^ (y=2,3,...). (14) 

f. Show that Equations (12), (13), and (14) yield, respectively, 



n =n oV (\)+ 2 MV-nW, 

k- 1 



n + 2 iMX)**- 1 ^) 



(15) 
(16) 



*/')-7=x[*-i(*)-*-ito] 



+ TZt[i(*) -*(')] 2 MVV*- J (V 0=2,3,...). (17) 
*-> 

g. Show that substitution of (16) and (17) into (4) yields 
u(s,z)[s— A(l — z)] = \zu(X,z) 



+\z[ V (X)-r,(s)] 



u(\,v(*))-u(\z) 
r,(X)-z 



o. Take s = \ — Az in (18), and show 



U »(A,z) u(\,t)(\))-u(\,z) 

7,(\-\z)- v {\) Z~ V (X) 



so that (18) becomes 



( x_ A^t/CO-t^A-A^MA,*) 
[s-(\-\z)][r,(\)-n(\-\z)) 



(18) 



(19) 



(20) 



i. Show that (15) implies 



«(A,t,(A)) = J7 [1-t7(A)], 



(21) 
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and deduce from (21) and (19) that 

u(Kz) = z-na-xz) • (22) 

j. Show that substitution of (22) into (20) yields (5). 

k. Verify by taking Laplace-Stieltjes transforms in (7) that (7) gives the 
inverse of (5). 

15. The integrodifferential equation of Takdcs. Let V t be the virtual waiting time in 
the M/G/l queue with order-of-arrival service, and define the distribution 
function V(t,x) = P{ V t <x). 

a. Show that, for h > 0, 

V(t + h,x) = (l-\h)V(t,x + h) 

+ \h( x+h H(x + h-y)d y V(t,y) + o(h) (A->0). 
■'o 

b. Show that 

BV&£ = W£x) _ xmx) + x f*„ (x _ y)dyV(t , y y (1) 

This is known as the integrodifferential equation of Takdcs (see Takacs 
[1955] and also Takacs [1962a] and Hasofer [1963]). 
c It is not unreasonable to assume that in equihbrium Equation (1) becomes 



dV(x) 
dx 



= \V(x)~\f X H(x-y)dV(y) (x>0), (2) 
•Jo 



where we have assumed that 

lun^>-0 

and 

lim V(t,x)=V(x). 



Define the Laplace-Stieltjes transform 9(s) = J^e sx dV(x), and show that 
Equation (2) transforms to 

9(3) - V(0) - * 9(s) - * r,(s)0(s). (3) 



d. Solve (3) for 0(s) and compare with (8.38). 
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The Busy Period 

A notion that will prove useful and instructive is that of the busy period, 
defined as the length of time from the instant a customer enters a 
previously empty system until the next instant at which the system is 
completely empty. 

A typical realization of a busy period is illustrated in Figure 5.2. The 
height of the graph at any time / is the duration of time from t until all 
those customers present in the system at time t are finished with service. 
Arrivals are represented by arrows pointing up, and departures by arrows 
pointing down. At each arrival epoch the graph jumps up by an amount 
equal to the service time of the customer who just arrived. The graph 
decreases linearly with time until cither another customer arrives or the 
graph decreases to zero, signifying the end of the busy period. Observe that 
the height of the graph at any time t equals the virtual waiting time V t , if 
service is in order of arrival. The realization pictured in Figure 5.2 is a 
busy period composed of six service times. 

We shall now derive an expression for the distribution function B(i) of 
the busy period for the M/G/l queue. Consider the system in which the 
server does not start to serve any customers until there are j > 1 customers 
in the system, and then serves these j customers one at a time, and all 
subsequent arrivals, until for the first time the system becomes empty. Call 
this length of time a j-busy period. Hence a y-busy period is the length of 
continuous busy time of a server that starts serving when j customers are in 
the system. 

Let B{t) be the distribution function of the busy period, and Bj(t) be the 
distribution function of the ./-busy period, where B l (i) = B{t). Following 
Takacs [1962a], we now show that the distribution function Bj(t) of the 
y-busy period is the y'-fold convolution with itself of the distribution 
function B(t) of the ordinary 1-busy period, 

Bj{t) = B*\t) = 0,1,2,...; B*°(t) = l) (8.61) 

(where the case j — is included to simplify forthcoming calculations). 

A little thought should convince the reader that the length of a y-busy 
period is independent of the order in which the waiting customers are 



Figure 5.2. 
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served. This being so, we choose to consider the following particular order 
of service. Suppose the server is about to begin service on one of j > 1 
waiting customers. When it finishes serving the first of these j customers, it 
then serves all those customers, if any, who arrived during the service time 
of the first customer, and then those customers, if any, who arrived during 
these service periods, and so on. That is, it serves first all those customers 
who arrive during the 1-busy period generated by the first of the original j 
customers. When the 1-busy period generated by the first of the original 
j customers terminates, the server starts on the next of the original j 
customers, and serves all those customers who arrive during the 1-busy 
period generated by him. The server continues in this manner, so that the 
length of time during which it is continuously busy is the sum of j 1-busy 
periods. These j 1-busy periods are clearly mutually independent and 
identically distributed. Hence Equation (8.61) is true. 

We shall now use (8.61) to obtain the Laplace-Stieltjes transform B(s), 



of the 1-busy-period distribution function B{t). Note that a 1-busy period 
will not exceed a value / if and only if the first service time lasts a length 
£<t and the j>0 customers who arrive during the first service time 
generate a /-busy period that does not exceed t— £. Thus we may write 



where H(g) is the service-time distribution function. 

Taking Laplace-Stieltjes transforms on both sides of (8.63), and noting 
that (8.61) implies that the Laplace-Stieltjes transform Bj(s) of the y-busy- 
period distribution function Bj(t) is 




(8.62) 




(8.63) 



0=o,i,...), 



(8.64) 



we obtain 




(8.65) 



Equation (8.65) reduces to 




o 



(8.66) 
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The integral on the right-hand side of (8.66) is the Laplace-Stieltjes 
transform ri(x) of the service-time distribution function, with argument 
x = s + \-\f3(s). Thus we have the functional equation 

^(5) = T J ( J + X-Ay8(5)). (8.67) 

We can obtain the mean b of the busy period by differentiating Equa- 
tion (8.67) and using the fact that - /?'(()) = £ and -tj'(0) = t, where t is 
the mean service time: 

fi'(s)-[l-\fi'(s)]lV+*-W(')) (8-68) 

and therefore 
or 

6-^. (8.69) 

[This result was derived earlier by a heuristic argument — see Equation 
(4.12) of Chapter 3.] Similarly, a second differentiation of (8.67) gives 

r(0) _jrffiL (8.70) 

0~p) 



from which one can calculate the variance of the busy-period distribution 
function. 

It is interesting to consider directly the special case of the M/D/l 
queue. Let Nj be the number of customers served during a y-busy period, 
and set 

Following Prabhu [1965], we can write the following recurrence: 



fU) = 

J n 



-Xrj 



( n= J)> 



2^e^fi% (n>j+\), 
k= i K - 



(8.71) 



where t is the (constant) service time. It is easy to show (see Exercise 16) 
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that the equations (8.71) have solution 



fU) = L 



(Atm)"- j 
(n-j)\ 



— Xth 



(»>J). 



(8.72) 



The distribution (8.72) is often referred to as the Borel- Tanner distribution 
or, when y= 1, the Borel distribution (Borel [1942], Tanner [1953, 1961]). 

It can be shown (see Prabhu [1965]) that 2*_^=1 when p< 1, and 
< 1 when p > 1 ; also E(Nj) = y(l - p) _ 1 when p < 1, and E(Nj) = oo 
when p> 1. Thus if p> 1 there is a positive probability that the busy period 
will continue indefinitely, whereas if p < 1 it will terminate eventually; 
however, the expected number of customers served during the y'-busy 
period will be finite only if p is strictly less than one. 

Note that the expression in brackets on the right-hand side of Equation 
(8.72) is the probability that during a time interval of length m (which is 
the time required to serve the n customers whose service times comprise 
this y'-busy period) exactly n—j customers arrive. Evidently, the factor j/n 
equals the probability that these n—j arrivals occur in such a way as to 
make this a busy period. 

In an exposition of the use of combinatorial methods and ballot theo- 
rems in the theory of queues and related subjects, Takacs showed that, 
remarkably, the above interpretation of the factor j/n is valid in the 
general case of the M/G/l queue (see Theorem 5, p. 102, of Takacs 
[1967]). It follows, therefore, for the M/G/l queue that if Bj(n,t) is the 
joint probability that the number of customers served during a y'-busy 
period is n and the length of the y-busy period does not exceed t, then 

^"•')=^/ 'g5 e " XSrf/f *" (a ( " >A (8 - 73) 

where //*"(£) is the n-fold convolution with itself of the service-time 
distribution function. Since 

n-j 

it follows that 



B > it) ~?Ji'(S e ~* dH " (() - (8 - 74) 

In particular, Equation (8.74) with j — 1 provides us with the inversion of 
the Laplace-Stieltjes transform f}($) defined by the functional equation 



232 



Imbedded-Markov-Chain Queueing Models 



(8.67). Thus, 

[Prior to his derivation of Equation (8.75) by combinatorial methods, 
Takacs [1962a] obtained (8.75) by inversion directly from the functional 
equation (8.67).] 

We have observed that the transform f$(s) can be inverted to obtain an 
infinite-series representation for the distribution function B{t\ as given by 
Equation (8.75). With regard to numerical inversion of the transform fl(s), 
it can be shown by using the monotonicity of the Laplace-Stieltjes trans- 
form that the iteration scheme 

Xi+M^iiis+X-XxXs)) [/ = 0,1,...; 0<jc (5)<1] (8.76) 

converges to fi(s), Xj(s)^>fi(s), for s > whenever Xt < 1. This is illustrated 
in Figure 5.3. Therefore one may calculate, for each s > 0, the value of the 
Laplace-Stieltjes transform fi(s), and, at least in theory, invert numerically. 
In practice, as we shall see, the quantity of interest may well be the 
Laplace-Stieltjes transform fi(s) itself, not the original distribution function 
B(t). 

The interesting point here is not so much the mathematics of solving 
functional equations (although this is important if one wants to get an 
answer), but the characteristic combination of mathematical and heuristic 
reasoning that leads to the functional equation (8.67) and the explicit 
solution (8.75). 
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Exercises 

16. Verify that (8.72) is the solution of (8.71). 

17. a. Let N k be the number of customers served during a Ac-busy period in the 

M/G/l queue. Give an argument, similar to that following Equation 
(8.61), to show that 

Ni+j-Ni+Nj, (1) 

where Nj and Nj are independent, 
b. Show that 

c Show that the truth of Equation (1) for the M/D/l queue implies the 
identity 

n ~ J 1 (n-ky k - J - 1 _ (i+j) 

*-,(*-')! (n-k-j)l ij {n-i~j)\ 

(i > \, n >i+j). (2) 

In particular, when i—j=l in Equation (2), we obtain the interesting 
identity 

2 1 (J)**- I ('»-*)""*" 1 -a(«-i)»"- a (»>a). P) 

Equation (3), which is difficult to derive from first principles, is a member 
of a class of formulas related to Abel's generalization of the binomial 
theorem (see, for example, pp. 18-23 of Riordan [1968]). 

18. Let Nj be the number of customers served during a y'-busy period in the 
M/M/X queue, and show that 

P{N . = „) = I ( 2 » -J : 1 \ p m ' J , ( „ >y) . 

K J > n\ n-j J(\ +P ) 2n -J 

[Hint: Use (8.73).] Observe that the distribution of Nj depends on the form of 
the service-time distribution function [as comparison with Equation (8.72) 
shows], whereas E(Nj) does not (see Exercise 17b). 

19. The "polite" customer. Let us call a customer polite if she declines to enter 
service whenever anyone else is waiting for service, and denote by W p her 
equilibrium waiting time. (Clearly, no queue is big enough for more than one 
polite customer at a time.) 
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a. Argue that 

P{W p <x\W p >0} = ljJ[l- Btf))dZ, (1) 

where B(t) is the distribution function of the busy period, and b is its 
mean. [Hint: Use Equation (7.9).] 

b. Conclude frum (1) that 

(See Takacs [1963].) 
c Show that 

£( W >0)=^ + 1!^, 

where o\ is the variance of the busy-period distribution function. 

d. Conclude that 

2r{\-p) 

where a 2 is the variance of the service-time distribution function. 

e. Show that for the M/ M/ s queue, 

E(W p \W p >0) = 1 



(1 -p) SIL 



Service in reverse order of arrival. 

a. Let T c be the arrival epoch of an arbitrary customer who arrives when the 
server is busy, and let T x be the first subsequent service completion epoch. 
Let Pj(x) be the joint probability that j customers arrive in (T t , Ti) and that 
T, - T c <x. Argue that 

•'0 J ' 

where H(t) is given by (8.41). 

b. Let W(t) be the equilibrium waiting- time distribution function, and argue 
that 

*n/) = (l-p)+pf ( t Pj{t-x)dB j (x), (1) 

7-0 ■'0 

where Bj(x) is the distribution function of the y-busy period, 
c Let co(j) be the Laplace-Stieltjes transform of W(t), and show by taking 
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transforms in (1) that 

<o(.) -(l-p)+A s+K i- m] • ( 2 ) 

where tj(j) and fi(s) are the Laplace-Stieltjes transforms of the service-time 
and busy-period distribution functions, respectively, 
d. Show that Equation (2) reduces to 

For more on this model see Riordan [1961], Wishart [1960], and, especially, 
Takacs [1963], who obtains formulas for the moments of the waiting-time 
distribution function, and gives the inverse of (3) as 

j-l J' J o 



5.9. The M/G/l Queue with Finite Waiting Room 

In this section we discuss the single-server queue with Poisson input, 
general service times, and a finite number n of waiting positions. [Hence 
there are n + 2 states E J (J ' = 0, 1 , . . . , «, n + 1), where the value of the index j 
is the number of customers in the system.] In the analysis of this important 
model we overcome a difficulty that has not appeared in any previous 
model. 

When the number of waiting positions is finite, a customer will depart 
from the system either after he has completed service or, if he arrives when 
all waiting positions are occupied, immediately upon arriving. Note that no 
customer departing after having been served can leave behind him a 
completely full system; at least one waiting position must be empty. 

We consider first those states E ,E v .,.,E tt in which a customer who has 
just completed service can leave the system. Let II? 0=0* be the 

equilibrium probability that a departing customer (after completing 
service) leaves behind j other customers. Then, following the previous 
analysis for the infinite-waiting-room model, we obtain again Equation 
(8.20), this time valid only f or y=0, — 1, which can be written 




(y = 0, 1,...,*- 1). (9.1) 



Equation (9.1) can be solved numerically by recurrence; one can succes- 
sively determine 2Tf,7TJ,..., JT* in terms of UJ. 
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If we are interested only in the distribution of the number of customers 
left behind by a customer who gets served, then IT* is determined from the 
normalization equation 

n*+n*+--- +n* n = i. (9.2) 

Observe that since this distribution {II*, 17*,,.., IT*} satisfies the same 
equations (9.1) as does that of the infinite-waiting-room model, the finite- 
waiting-room probabilities I2j*,II*,...,TT* are proportional to the corre- 
sponding infinite-waiting-room probabilities. We assume in what follows 
that the probabilities 11$, IT*, . . . , IT* are now known. 

We wish to find the distribution of the number of customers found by 
an arbitrary arrival (not just those arrivals that are served). That is, we 
wish to find the arriving customer's distribution { IZq, TI x , II n , II n+1 }. We 
have found the distribution {TT£,IT*,. ..,TT^} of the number of customers 
left in the system by a departing customer who is completing service. By 
the theorem of Section 5.3, {IT*, IT*,..., TT*} is also the distribution that 
describes the viewpoint of the arrivals, excluding those who find the system 
in state E n+1 ; that is, IT? is the conditional probability that an arbitrary 
arrival finds the system in state E p given that j<n+\. Therefore, the 
probabilities {IT,} and {Ilf} both satisfy the same equation (9.1) when 
j<n + l t and it follows that they are proportional when j<n + l: 

nj= c n; 0=0,1,...,*). (9.3) 

It remains to find a new equation that permits calculation of the probabil- 
ity TT W+ , that an arriving customer finds all n waiting positions occupied. 
The constant of proportionality c is then calculated from the normalization 
equation 

ITo + IT^-- - +H„ + H„ +1 = 1. (9.4) 

Observe that H n+1 is the proportion of customers who are not served. If 
one wished to design a system with sufficient number of waiting positions 
so that only a given proportion of the arriving customers are turned away 
(cleared), then he could determine the number n of waiting positions 
required from a plot of H n+1 versus n. 

We return to consideration of the new equation required to calculate 
IX.+1- The carried load a' (the mean number of busy servers) is 

a' = P x +>- +/> n+1 = l_ j p ; (9.5) 

therefore, for systems with Poisson input, it follows from the equality of 
the outside observer's distribution {P ,Pi,~,P n ,P n +\} and the arriving 
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customer's distribution { JT , JT 1 ,... > iT n ,iI n+1 } that a' is given by 

a'=l-JT . (9.6) 

The probability that an arbitrary customer is served is 1 — H„ + \\ therefore, 
since a' is that portion of the offered load <z( = \t = p) that is carried 
(served), we have 

a> = a{\-n n+x ). (9.7) 

(In what follows, we use this symbol a instead of p to emphasize its 
interpretation as offered load.) Combination of (9.6) and (9.7) yields the 
required equation: 

U^ a -^. (9.8) 



Substitution of (9.3) and (9.8) into (9.4) yields 

1 



c = 



(9.9) 



ns+a%% n? 

Substitution of (9.3) into (9.4) and use of (9.2) yields 

c = l-JZ, + I ; (9.10) 

also, (9.9) becomes 

e -n&- <»■"> 

Hence, we conclude from (9.3) and (9.11) that 



and, from (9.10) and (9.11), 



Thus we see that although the finiteness of the waiting room precludes 
the writing of an equation for the probability 1 from the point of view 
of a customer completing service, we can obtain the necessary equation 
(9.8) independently from consideration of the carried Joad. (For a much 
more detailed study of this model, see Chapter III.6 of Cohen [1969]; see 
also Bhat [1973] and Hokstad [1977] for variations of this model.) 
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Exercise 

21. It is required to calculate the arriving customer's equilibrium state distribution 
{IIj} for the M / G/l queue with n waiting positions. 

a. Show that {IIj} can be calculated from the following algorithm: 

1. Assume an arbitrary value for say J7J = 1, and calculate II* , . . . , U* 
from Equation (9. 1). 

2. Set </= 11$ + nf + + II* and calculate 

n* 

_ ns+(a-i)d 

n+l IIS + ad ' 

b. Suppose that the state distribution, say {IIj}, for the corresponding in- 
finite-waiting-room queue has been previously calculated. Show that 

Vj-- = 0,1,...,/!), 

n +a 2 n, 

Ilo + a^H 

c Using the results of part b, calculate the distribution {IIj} for the particular 
case of exponential service times. Compare with the corresponding proba- 
bilities {Pj} computed from the "rate up = rate down" equations for a 
birth-and-death process. 

Another approach to the analysis of the M/G/l queue with finite 
waiting room is through use of the mean busy period. Let b(n) denote the 
mean busy period in the M/G/l queue with n waiting positions. Then the 
probability 1 — P that the server is occupied is the ratio b(ri)/[\~ l + b(n)] 
of the mean busy period to the mean cycle time. But this probability also 
equals the carried load [see (9.5)]; hence 



It follows from (9.14) and (9.7) that 

T + ab(n) y 



therefore, one can calculate the loss probability JT„ +1 if one can calculate 
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the mean busy period b(n). We now give a recurrence for the calculation of 
b{n). 

We begin by writing 

Kn)-r+"j£ Pj 2 b(k)+ f Pj 2 b{k) (»-ai,...). 

(9.16) 

where pj is defined by (8.17), and any undefined sum is taken to equal zero. 
To prove (9.16) observe first that, clearly, &(0) = t. Now assume n>l. 
Observe that the busy period is composed of the service time of the first 
customer plus some additional time if there are any new arrivals during the 
first service time. Suppose that exactly j (1 <y <*n— 1) arrivals occur during 
the first service time. Then, as the second service time begins, there will be 
j—l customers waiting in the queue. Since the length of the busy period 
does not depend on the order of service of waiting customers, we can 
imagine that none of these j — 1 waiting customers will enter service until 
any and all new customers are served who enter the waiting room after the 
start of the second service time. Thus, the mean time until the next (if there 
is one) of the original j customers enters service is b(n— y' + l) (because 
only n—j+l waiting positions were available to new arrivals during this 
time). Hence, with this queue discipline, the mean time required for the 
system to become completely empty is b(n—j+\)-\ \- b(n); this ex- 
plains the second term on the right-hand side of (9,16). Finally, if j>n 
customers arrive during the first service time, then the mean time until the 
system becomes completely empty is b(l)+ + b(n); this explains the 
third term on the right-hand side of (9.16). 
Equation (9.16) can be rewritten 



b(n) = r+ 2(1- *iLpj\b(n-k) (n = 0,l,...). (9.17) 

* = 0\ 7 = / 

If we subtract the (n— l)th equation from the nth in (9.17), we get the 
following system, which appears most suitable for calculation by re- 
currence of b(l), b(2), ; 



b(n) = 



PO T («=0, 



Po l 



(,. = 2,3,...). (9 - 18 > 



Equation (9.18) thus provides an easy algorithm for the calculation of the 
loss probability given by (9.15). An attractive feature of this algo- 

rithm is that in the process of calculating the loss probability il„ +1 for the 
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M/G/X queue with n waiting positions, one obtains as a by-product the 
corresponding loss probabilities for all the systems with smaller waiting 
rooms. 

We have given two methods for the calculation of the probability of loss 
in the M/G/X queue with finite waiting room. For this model, the first 
method is probably easier to implement. However, the busy-period method 
is conceptually easier, and may be easier in practice if the number of 
waiting positions is small and the model is not standard. An example is 
provided by Exercise 22. 

Exercises 

22. A particle-counting device consists of three one-particle buffers and a re- 
corder that visits the buffers and counts the particles. Particles arrive at each 
of the buffers in three independent Poisson streams, each with rate A. Any 
particle that finds its particular buffer occupied is cleared from the system, 
that is, not counted. A buffer is held by a particle both while waiting for the 
counter and while being counted. The time required for the recorder to count 
a particle (and empty a buffer) is a constant of length unity. Show that the 
proportion p of particles not counted by this device is 



F l + 3X(e 3X +e x -^)' 

23. Let B(J,k) be the duration of the /-busy period in the A// (7/1 queue with 
j+k—l waiting positions, with E(B(j,k)) = b(j,k); and let P(j f k) be the 
probability that throughout the busy period there will always be at least one 
unoccupied waiting position. 

a. Show that 

B(j,k) = B(l,k) + BU-X t k+X), 

and thus 

B(j,k) = B(X f k) + B(l,k+ 1)+ ••• +B(IJ+ k- 1). 

b. Show that 

bU,k) = b(j,k-X) + [l-P(j,k)]b(\J + k-X) (k>X). 

[Hint; Imagine that the customer (if there is such a customer) whose 
arrival causes the waiting room to be fully occupied for the first time will 
not enter service until there are no other waiting customers.] 
c Conclude that 

***>- w'A <*>•>■ 
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d. Show that when service times are exponential, then 

P(j,k) - P{j, k, a) - — — ■ , 

\ + a + a 2 + • • +a J+k 1 

where a is the offered load. 

e. The classical ruin problem can be formulated as follows: A gambler with 
initial capital j (1 <,J< oo) dollars plays against an adversary whose initial 
capital is k (1 <k < oo) dollars. The game consists of independent trials, at 
each of which the gambler wins a dollar from his adversary with probabil- 
ity p or loses a dollar to his adversary with probability q « 1 —p. The game 
continues until the gambler's capital either is reduced to zero or is in- 
creased to j+k, that is, until one of the two players is ruined. Let P'(j,k,p) 
denote the probability of the gambler's ruin, and show that P'(j,k f p)^ 
P(J, k,a), where p = a/(l + a); that is, 

P'(j,k,p) = P(j,k,£). 

The relationship between combinatorial problems and stochastic processes, as 
in part e, has been systematically studied by Takacs [1967]. This exercise is 
adapted from a paper by Cooper and Tilt [1976], whose main theme concerns 
the result of part c. 



5.10. The M/G/X Queue with Batch Arrivals 

An interesting variant of the M/G/l queueing model occurs when one 
assumes batch arrivals. That is, each arrival epoch now corresponds to the 
arrival of a batch of customers, where the batch sizes are independent, 
identically distributed, random variables. The customers within a batch are 
served one at a time, and, as before, the service times of the customers are 
independent, identically distributed, random variables, with distribution 
function H(x). (Thus, when the batch size is always one, this model 
reduces to the ordinary M/G/l queue.) Let W be the equilibrium waiting 
time for an arbitrary (test) customer. Then W= W x + W 2 , where W x is the 
time from arrival to start of service of the test customer's batch, and W 2 is 
the remaining time until start of service of the test customer. We shall 
show that 

and 



P (iv\- ( m ~ 1 ) T , ° 2t 



(10.2) 
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where t and a 2 are, as before, the mean and variance of the service-time 
distribution function; and m and a 2 are the mean and variance of the 
distribution of the number of customers in an arbitrary batch; and 
X mr < 1 . 

This model is worthy of our attention here, not only because it is an 
interesting variant of the ordinary M/G/l model, but also because, as 
pointed out by Burke [1975], several treatments of this model that appear 
in the literature give incorrect expressions for E( FF 2 ). 

Let N be the size of an arbitrary batch, and let S N be the sum of the 
service times of the customers in an arbitrary batch. Then S N = X l 
+ • • • + X s , where X t is the service time of the ith customer in the batch, 
where E(X^ = t and V(X i ) = o 2 . Now, if we view a batch as if it were a 
single customer with service time S N , then, according to (8.39), we can 
write 

" 2[l-A£(S w )] \ E\S„)I 

According to Exercise 5, the compound sum S N has mean 

E(S N ) = mr (10.4) 

and variance 

V(S N ) = ma 2 + r 2 a 2 . (10.5) 

Substitution of (10.4) and (10.5) into (10.3) yields (10.1). 

To obtain (10.2) observe that the discussion of Section 2.1 applies; Let 
N' be the number of customers (balls) who arrive in the batch (urn) that 
contains the test customer. Then, according to Equation (1.5) of Chapter 2, 

P{N f =J} = JP{N m =j} , 00.6) 

(where the "cycle" of that discussion corresponds to the "batch" in our 
present discussion). Now, 

E{W 2 )= f E(W 2 \N'-j)P{N'-j), (10.7) 



where, clearly, 



E(W 2 \N'=j)=&Y±r. (10.8) 
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Substitution of (10.6) and (10.8) into (10.7) yields (10.2). (The error alluded 
to earlier resulted from ignoring the difference between N, the size of an 
arbitrary batch, and N\ the size of the batch to which the test customer 
belongs.) 

Exercise 

24. Consider the equilibrium M/G/l queue with batch arrivals, and assume that 
the batches are served in order of arrival. Let u(s) be the Laplace-Stieltjes 
transform of the waiting-time distribution function for an arbitrary customer, 
and show that 

. W s-Ml-*fo(*))] m[l-u( S )]' 

where ti(s) = f^e~ 3X /JH(x), and g(z) is the probability-generating function of 
the number of customers in a batch. 

5.11. Optimal Design and Control of Queues: The A'-Policy 
and the T-Policy 

So far in this text we have discussed only descriptive queueing models, that 
is, models that are completely specified with regard to method of opera- 
tion. One reason for studying descriptive models is that they permit one to 
determine the values of the model parameters that result in the most 
efficient (that is, optimal) operation. For example, if it is required to design 
an Erlang loss system that is optimal according to the criterion that, for a 
given offered load a, no more than p% of the arriving customers will be 
blocked, then one simply chooses the number s of servers to be the 
minimum value such that B(s,a)<p/\00. (Here we are making the tacit 
and reasonable assumption that the cost of operating the system is an 
increasing function of the number of servers, and our objective is to meet 
the stated blocking criterion at lowest cost.) Recently, much attention has 
been directed to the problem of prescriptive queueing models, that is, 
models in which the optimization is carried out over a class of operating 
policies, rather than over a set of parameters for a single operating policy 
that is fixed as part of the model. Of course, one might view a set of 
possible operating policies as nothing more than a set of parameters for a 
single (more general) operating policy; that is, one might claim that the 
distinction between the static optimization of descriptive models and the 
dynamic optimization of prescriptive models is merely semantic obfusca- 
tion. It turns out, however, that the concept of dynamic optimization of 
queues is a fruitful one. 

There is now available a sufficient body of research in this area so that 
one can talk about a general theory of optimal design and control (or 
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operation) of queues. This theory is an application and extension of other 
theories and concepts such as dynamic programming, sequential decision 
theory, and Markov and semi-Markov decision processes. In particular, 
the theory of Markov and semi-Markov decision processes is discussed in 
Ross [1970]. Four papers that complement each other and among them 
provide a comprehensive explication and bibliography of research into 
optimal design and control of queueing systems are Stidham [1977], 
Stidham and Prabhu [1973], Sobel [1973], and Crabill, Gross, and Mag- 
azine [1977]. 

In this section we shall illustrate these ideas by considering several 
different types of optimal control policy for the M/G/l queue. However, 
our methodology will not be in the framework of the (rather abstract) 
general theory presented in Stidham and Prabhu [1973]; instead, we shall 
exploit the particular structure of the model and rely on certain intuitive 
arguments to obtain the optimal control policies. 

We begin by considering the M/G/l queue with the following cost 
structure. The server can be in either of two states, "off and "on," and 
can be turned off or on at any time. The server can provide service only 
when it is on. Customers who arrive when the server is off wait in the 
queue. The running cost of keeping the server on is c dollars per unit time, 
whether or not any customers are being served. To save money, the server 
can be turned off when it is idle; however, a switching cost of c x dollars is 
incurred each time the server is turned back on. Also, a (linear) holding cost 
of c 2 dollars per unit time is incurred for each customer present in the 
system, whether waiting or being served. Our objective is to find the 
optimal operating policy, that is, the rules for turning the server on and off 
that result in the lowest long-run cost. 

A stationary policy is defined as one that always prescribes the same 
action whenever the system is in a given state. For obvious reasons, 
attention is usually restricted to the class of stationary policies, and the 
following three questions are asked: (1) when does an optimal stationary 
policy exist, and, if it exists, (2) what is its structure, and (3) what are the 
values of its parameters? These are difficult questions to answer in general 
(see Stidham and Prabhu [1973]), but we can often make some reasonable 
guesses for specific problems. In particular, if we assume that an optimal 
stationary policy exists, and can make a reasonable guess as to its struc- 
ture, then it may not be difficult to obtain the values of the parameters of 
the putative optimal policy. 

For the particular problem at hand, let us assume that an optimal 
stationary policy exists, and try to infer what the structure of such a policy 
might be. It seems obvious that if the server is on and there are customers 
present, there would be no advantage in turning it off, because to do so 
would incur unnecessary holding costs. Likewise, if the server is off and 
there are no customers present, there would be no advantage in turning it 
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on, because to do so would incur unnecessary running costs. Therefore, we 
conjecture that there is an optimal stationary policy and that it is one of 
the following: 

1. Turn the server off when there are no customers present, and turn it on 
when there are n customers present. Such a policy is called an N -policy. 

2. Leave the server on permanently. Such a policy is called a do-nothing 
policy. 

If there is indeed an optimal stationary policy, and if it is in fact of the 
form conjectured, then our optimization problem reduces to finding the 
value of the parameter n that produces the optimal iV-policy, and compar- 
ing this policy with the do-nothing policy. A proof that an Af-policy or a 
do-nothing policy is, in fact, the optimal stationary policy (when the 
control information available is the number of customers present at each 
point in time) is given in Heyman [1968], using the same kind of dynamic- 
programming argument that is formalized in Stidham and Prabhu [1973]. 

Of course, it makes sense to consider the iV-policy only if such a policy 
can be implemented. For example, suppose that the server of our model 
represents an electronic telephone switching machine that cannot directly 
"see" if a customer has arrived, but must periodically "look" to detennine 
whether a customer is present. If we assume that each look corresponds to 
turning the server on, then we might conjecture that there is an optimal 
deterministic stationary policy that is one of the following: 

1. Turn the server off at the end of a busy period (when there are no 
customers present), and turn it on (that is, look) after an interval of 
length t. If there are no customers present when the server looks, turn it 
off and look again after another interval of length t. Repeat this 
procedure until the server finds a customer waiting. Then, leave the 
server on until the end of the busy period. Such a policy is called a 
T -policy. 

2. Leave the server on permanently. 

Hence, optimization of this system reduces to finding the value of the 
parameter t that yields the optimal ^-policy, and comparing this policy 
with the do-nothing policy. (The T-policy, motivated by the application 
described here, has been studied by Heyman [1977].) 

It is intuitively clear (and we will show it) that, all other things being 
equal, the optimal T-policy is inferior to the optimal iV-policy (because the 
T-policy uses less information about the status of the queue than the 
iV-policy). The point to be made here is that the structure of the optimal 
stationary policy depends upon the constraints built into the original 
model. 

We shall consider each of these policies in turn. Our approach is to 
study the general case of the P-policy; specialize our results to obtain 



246 



Imbedded-Markov-Chain Queueing Models 



fonnulas for the optimal iV-policy and the optimal T-policy; and prove 
that, all other things being equal, the former is superior to the latter. 

To this end, we denote by a p the expected value of the cost per customer 
to turn the server on under policy P; and we denote by b P the expected 
value of the waiting cost per customer under policy P in excess of that 
which would be incurred under any policy according to which the server 
can never be off when a customer is waiting. Let 

c P = a p + bp. (1 1-1} 

Define the random variable M P as the number of customers served, under 
policy P, during the busy period of a particular customer (the test 
customer) whose viewpoint we will adopt. If we let Y be the number of 
times during a cycle that the server looks at the system and finds no 
customers waiting (which may be always zero, as in the iV-policy), then 
c^l + Y) is the total cost per cycle of activating the server. (We are 
assuming that the server must be turned on to look.) Therefore, Cj(l + 
Y)/ M P is the test customer's share of the cost of turning on the server (to 
implement the test customer's cycle). Now 

^i^^y^^Mi)' (,u) 



where the first equality in (11.2) follows by definition, and the second 
follows from the properties of the expected-valuc operator and the inde- 
pendence of Y and M P . To calculate E(\/M P ), which is given by 

*(±)-$\r(M r -i). (iu) 

observe that the discussion of Section 2.1 of Chapter 2 applies: Let N p be 
the number of customers (balls) who are served during an arbitrary busy 
period (urn) under policy P. Then, according to Equation (1.5) of Chapter 
2, it follows that 

w-')-^- <»<> 



Substitution of (11.4) into (11.3) yields 
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To calculate E(N P ), we write 

E(*p)= 2 E(N P \A P =j)P{A P =j}, (11.6) 
j~\ 

where the random variable A P is the number of arrivals waiting when the 
busy period begins, under policy P. It follows from Exercise 17b that 

E(N P \A P =j)=jL^. (11.7) 

Hence, it follows from substitution of (11.7) into (11-6) that 

To calculate E(A P ), observe that the vacation-time model of Exercise 12 is 
applicable: If we say that when the server is off it is on vacation, then 

where f p (z) is the probability-generating function of the number of 
customers present at the end of a vacation, and P(0) is the probability that 
no customers arrive during a vacation. It now follows from (11.5), (J J. 8), 
and (1 1.9) that (11.2) can be written 

c 1 (l-p)ri- J P(0)l 

/;o) ( 1U °) 

To calculate b p , let the random variable X p be the test customer's 
sojourn time under policy P, and let X be the sojourn time that the test 
customer would experience if the server can never be off when a customer 
is waiting. Then the expected value of the test customer's excess waiting 
cost is 

b P -E[c 2 (X P -X)]-c 2 [E(X P )-E(X)], (11.11) 

where the first equality in (11.1 1) follows from the definition of b p and the 
assumption of a constant linear holding cost c 2 , and the second equality 
follows from the properties of the expected-value operator. Now, using 
L-XW, it follows that E(X) = g'(\)/\ where g(z) is the equilibrium 
probability-generating function of the number of customers present in the 
ordinary M/G/l queue; similarly, we can write E(X P ) = g P (\)/\, where 
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g P (z) is the corresponding probability-generating function in the M/G/l 
queue operating under policy F. If we again adopt the viewpoint that the 
server's off periods correspond to vacations, then it follows from Equation 
(3) of Exercise 12 that 

f"(l) 

«M-*w m m>-- (1112 > 



therefore, 



"'-am- (1113) 



Finally, it follows from (11.10) and (11.13) that (11.1) becomes 
c,(l-p)[l-f(0)] C2 /;(l) 

' W) [l+E(Y) l + MU)- (1U4) 

Equation (1 1.14) is the key result: To find the optimal policy of type P, it 
suffices to find the value of the parameter associated with policy P that 
minimizes the right-hand side of (11.14). 

For example, consider the TV-policy. Then, clearly, E(Y)=0, P(0)—0, 
and 

f N (z) = z n , (11.15) 

and hence 

■#(!)-» (11.16) 

and 

%{\)-n{n-\). (11.17) 

Thus, for the iV-policy Equation (11.14) becomes 

, v c,(l — p) c->(n— 1) 
c N -c N {n)- iy n P} + 2V 2X ' (n-1,2,-..). (H-18) 

Let n* be the value of the parameter n that minimizes the right-hand side 
of (11.18): 

c„(n*)= min c N (n). (11.19) 
If we treat n as a continuous variable, call it jc, and denote by jc* the 
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minimizing value of x, so that 

c n( x *) = min c N (x), (11 .20) 

then x* can be obtained by the well-known method of calculus: If 
c£(x)>0, then x* is the root of the equation 

-^-c N (x) = (x>0). 

It easily follows from (1 1.18) that 



(11 - 21) 



and 



d 2 

— c N {x)>0 ( X >0). 



Hence x*, which is the value of x for which the right-hand side of (11.21) 
vanishes, is 



-V 



2A(1 -p)c, 

v — . (11.22) 



«* is one of the two integers adjacent to jc*; it is found by substituting 
these two integers into (11.18), and taking as n* the integer that gives the 
smaller value of c N . 

Thus, we have found the value n* of the parameter n that gives the 
optimal JV-policy. However, the optimal JV-policy is not necessarily better 
than the do-nothing policy, which is the policy of never turning the server 
off. To analyze this policy, we calculate the cost of running the server 
during the idle period that precedes the test customer's busy period, and 
distribute this cost equally among the customers served during the test 
customer's busy period. Let c(0) denote the expected value of this cost; 
then 



c(0) 



where X is the interarrival time and M is the number served during the 
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test customer's (ordinary 1-) busy period. Then 
and, by the same reasoning that led to (11.5), 




where N is the number of customers served during an ordinary 1-busy 
period. Now, E(N^) = 1/(1 — p), and therefore, 



c(0) = 



(11.23) 



If we set c(0) = c N (n*), we obtain c =Xc N (n*)/(l — p). We conclude that if 
c >\c N (n*)/(l — p) the optimal policy is the N-policy with parameter n*, 
while if c <\c N (n*)/(l — p) the optimal policy is never to turn the server 
off. Note that although the long-run operating cost depends, in general, on 
the service-time distribution function, the optimal policy depends only on 
the mean service time t (and the parameters c v c 2 , and A). 

For example, suppose A = |, t=1, c 1 = 12, and c 2 = 2. Then it follows 
from (1 1.22) that x* = V3 , and thus 



1 if c N {\)<c N {2\ 

2 if c N (l)>c N (2). 



We have from (11.18) that c N (\) = 6 and c N (2) = 5. Therefore, n*=2 is the 
parameter of the optimal iV-policy. If c >Xc N (n*)/(\ — p) = 5, then the 
cheapest long-term operation is obtained with the iV-policy with n* = 2; 
whereas if c <5, then the best operating policy is to leave the server on 
permanently. 



Exercise 

25. Let c be the minimum mean operating cost per unit time in equilibrium for 
the M/G/l queue operating either continuously or under an iV-policy, and 
show that 



C + C 2 



2(1 -p) 
c p + Xc N (n*) + c 2 \ p + 



2(1 



when c < 
when c > 



*c*(**) 
1-p 

1-p 
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where a 2 is the variance of the service-time distribution function. Evaluate c 
for the above example when service times are exponentially distributed and 
(a) c —4, and (b) c = 6. Repeat the calculations for the case of constant 
service times. 

Let us now turn to consideration of the T-policy. We leave it to Exercise 
26 for the reader to show that Equation (11.14) specializes to 

c T =c T (t) = ^ — + — . (1 1.24) 

Let us define t* as the value of t that minimizes the right-hand side of 
(11.24): 

c T (t*) = min cJt). (1 1.25) 

Then, exactly the same arithmetic that led from (1 1.18) to (1 1.22) will lead 
from (11.24) to 

'♦-^F- <»*> 

Inspection of (11.22) and (11.26) shows that 

x*=\t*. (11.27) 

Equation (11.27) shows that the mean number of customers present when 
the server is activated (turned on to look) in the optimal T-policy is 
precisely the value of the parameter n* that would define the optimal 
JV-policy if that value were not required to be an integer. Observe also that, 
as with the JV-policy, the optimal T-pokcy does not depend on the form of 
the service-time distribution function. 

We stated earlier that it is intuitively clear that, all other things being 
equal, the optimal 7-policy is inferior to the optimal iV-policy, because the 
r-policy uses less information about the status of the queue. To prove this, 
suppose first that x* happens to be an integer. Then c r (/*)- c N (n*) = 
Cjit*)— c N (x*) = Cj{t*)— c N (\t*% where the last equality follows from 
(11.27). Now application of (11.24) and (11.18) shows that Cj{t*) - c N (kt*) 
= c 2 /2X > 0; that is, 

c T (t*)>c N (n*). (11.28) 
Now suppose that x* is not an integer. We first show that 



c T (t*)>c N (x* + \), 



(11.29) 
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and then that 

c N (x* + l)>c N (n*y, (11.30) 

these two inequalities show that (11.28) is correct also when x* is not an 
integer, which will prove our assertion. 
To prove (1 1.29), we write 

f * ,\ C l(l - P) C 2 X * C,(l-p) C 2 X* 

where the equality follows from (11.18), and the inequality follows from 
the fact that 

1 J_ 
x*+l < x* ' 

But it follows from (11.27) and (11.24) that the right-hand side of the 
inequality (11.31) equals <v(r*); we have established (11.29). 

To prove (11.30), let us denote by n x and n 2 the integers adjacent to x* 
on the left and right, respectively. Then, n 2 <x* + 1, which implies that 

c N (n 2 )<c N (x* + l) (11.32) 

[because, according to (11.21), c N (x) is an increasing function of jc when 
x>x*]. Thus, (11.30) is true when n* = n 2 . If n* = n u then, of course, 
Cjv(»i)<Cjv(«aX and it then follows from (11.32) that c JV (n 1 )<c^(x*+l). 
Thus, (11.30) is true also when n*~n v The proof is complete. 



Exercise 

26. Consider the M/G/l queue operating under a T-policy, with parameter /. 
a. Show that 



b. Show that 

f T (z) = e-V-*»<. 
c Conclude that (11.24) is correct. 

The results x*, n*, and c N (n) for the jV-policy were first derived by 
Yadin and Naor [1963]. This model was extended and studied further by 
Heyman [1968], Other studies of this model are referenced in the three 
survey papers mentioned at the start of this section. All of the results given 
here for the T-policy were first derived by Heyman [1977], An interesting 
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and related model is the D-policy, according to which the server is turned 
off at the end of each busy period and turned back on only when the sum 
of the service times of the waiting customers exceeds some fixed value d. 
This model is more difficult to analyze, and cannot be handled by the 
method given here. The interested reader is referred to Balachandran 
[1973] and Balachandran and Tijms [1975], 

5.12. The M/G/l Queue with Service in Random Order 

In this section we consider the equilibrium M/G/l queue in which 
whenever a customer completes service the next customer to begin service 
is selected at Tandom from among all the customers waiting in the 
queue. Our objective is to derive algorithms for the numerical computa- 
tion of the complementary conditional waiting-time distribution function 
P{W>t\W>0}, which is the probability that an arbitrary customer who 
is forced to wait in the queue will wait more than t (for any />0) for 
service to commence. The unconditional waiting-time distribution function 
can then be calculated from the relation P [W>t} = P {W>t\W>0} 
xP{W>0}; for the M/G/l queue, P{W>0}=p. This model has been 
studied by LeGall [1962] and Kingman [1962], who used transform 
methods. These yield formulas for the moments of the waiting-time distri- 
bution function (see Takacs [1963]), but numerical calculation of the 
waiting-time distribution function requires the inversion (numerical or 
otherwise) of transforms. 

The algorithms given here, which are taken from Carter and Cooper 
[1972], do not use transform methods. Nevertheless, numerical calculations 
may be impractical except in certain special (but important) cases. We will 
present the results of numerical calculation for the special case M / E k /\ 
with k=\, 2, and 3. These results are in excellent agreement with those of 
Neuts [1977], who used this model to compare with his method of using 
phase-type approximations for numerical calculations. 

The special case of the M/D/l queue with service in random order has 
been studied by Burke [1959], who, by utilizing certain simplifying results 
that apply only to the case of constant service times, obtained an expres- 
sion suitable for the calculation of the waiting-time distribution function. 
Burke's paper includes a useful set of curves. Additional discussion on the 
numerical properties of the waiting-time distribution function for the 
M/D/l random-service queue is given in Lee [1966], 

Suppose that an arbitrary customer (the test customer) arrives at time T c 
and finds j > 1 other customers in the system (either in service or waiting 
for service). Then the length of time that the test customer waits for service 
to begin is the sum of 

1. the elapsed time between the test customer's arrival epoch T c and the 
first subsequent service completion epoch T x , and 
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2. the elapsed time between T x and the instant at which the test 
customer commences service. 

The analysis of the total waiting time is complicated by two facts. First, 
the time intervals (1) and (2) are not, in general, statistically independent. 
Second, as we have pointed out, any customer in service at epoch T c when 
the test customer arrives does not, in general, have the same service-time 
distribution function as does an arbitrary customer. Specifically, the rela- 
tionship of the distribution function H(x) of the length of the service 
interval containing the arrival epoch T c to the distribution function H{x) 
of the length of an arbitrary service interval is obtained by application of 
Equation (7. 1 9), namely, 

dH(x)"~xdH(x); (12.1) 

likewise, the distribution function H (x) of the length of the remainder of 
this service interval, from T c to T u is obtained by application of (7.9) [see 
also Equation (8) of Exercise 14], namely, 

H{x)=\f*[\-H{ii)]dt. (12.2) 

Note that in the special case of constant service times these difficulties 
disappear. When the service times are constant, the delay suffered by a 
blocked customer between the instant of his arrival and the first postarrival 
departure epoch is independent of the delay subsequent to this epoch, and 
the distribution of the service interval containing T c is the same as that of 
an arbitrary service interval (that is, constant). 

We are now ready to calculate for the case of general service times the 
conditional probability P{ W>t\ W>0), where W is the waiting time of 
the test customer. We denote by T c the test customer's arrival epoch, and 
by T x the first departure epoch subsequent to T c . We also define N(T) as 
the number of customers in the system (including service) at any time T, 
and W(T) as the remaining waiting time for service to commence for any 
customer who is waiting at time T. Then we may write 

P{W>t\W>0)=P{T l -T c >t\JV>0} 

+ 2 f t P{W r {T l + 0)>t-£\W>0,N(T l +0)=j} 

X 4 P { N( T t + 0) =y\ T, - T c < £ J W > } . 

(12.3) 

Let us define 



fi/{) = P { N( T x + 0) =;\ T x - T c < £| W > 0} = 1,2,...). (12.4) 
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Now consider the service interval containing the test customer's arrival 
epoch T c . The probability that this interval has length between x and 
x + dx is dH{x), given by Equation (12.1). The probability p t {x) that, in 
addition to the test customer, exactly i other customers arrive during such 
an interval of length x is, by assumption, the Poisson probability 

0=0,1,...). (12.5) 

Since the arrival process is Poisson, then for fixed x the length of the 
interval (T c , 7\) is uniformly distributed over (0, x). It follows that Q,(f) has 
density 

^e,(i)=e/«) 

given by 

QJ(0 = Jf °°( n* Pj ,_ x {x) + 2 n*Pj-M^ ^ dH{x) o - 1, 2, . . . ), 

(12.6) 

(where {H?} is the equilibrium state distribution as seen by a departure), 
which, in view of Equation (12.1), can be written 




(12.7) 

Now let us define 

W J (x) = P{W(T l +0)>x\W>0,N(T l +0)=j} = 2,3,...) 

(12.8) 

and W l (x) = Q. Then Equation (12.3) can be written 

00 /—I rt - 

p{w>t\w>Q} = \~H(t)+ 2 J —r~ / wn-OQXOdL (12.9) 

y= 2 J J 

where H(x) is given by Equation (12.2), <?_/(£) is given by Equation (12 J), 
and Wj(x) satisfies the following set of integral equations: 

Wj(x)=\-H{x)+ | ^f 1 ^ ( X Pi {iW j+i ^{x-®dH{Z) 

= 2,3,...). (12.10) 
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It remains to solve Equation (12.10) before (12.9) can be used. One 
formal approach is to assume that each function Wj(x) has a convergent 
Maclaurin series expansion, and then use (12.10) to determine the 
coefficients in the expansion. This method is outlined in Exercise 27. 
Although numerical calculation by this algorithm may appear hopelessly 
complicated, it has in fact been demonstrated for the M / E k / 1 queue. The 
results are reproduced in Table 5.1. 

Another method for obtaining the functions Wj(x), suggested by P. J. 
Burke [1967, unpublished], involves the introduction of an additional 
conditioning variable (sometimes called a supplementary variable). Let S k 
be the sum of the first k service times commencing at epoch T v and let the 
additional conditioning variable X(x) be the value of the largest integer k 
(A; = 0, 1,...) such that S k <x. We define Wj k (x) as follows: 

W jt k {x) = P { W{T X + 0) >jc| W >0, N{T X + 0) = j, X(x) = k] 

0-2,3,...; *-0,l,...) (12.11) 

and W x k (x)=0, where, as before, T, is the first service completion epoch 
after the arrival of the test customer, N(T 1 +0) is the number of customers 
in the system (including service) just after epoch T v and W(T x + 0) is the 
remaining waiting time for service to commence for a customer who is 
waiting just after time T v 



Table 5.1. Sample Calculations by the Maclaurin-Series Method for 
P { W > t\ W > 0} for the M/E k / 1 Random-Service Queue 





p = 0.3 


p = 0.8 


k = \ 


k = 2 


Jfc = 3 


Jfc-1 


k~2 


Ar — 3 


0.0 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


0.2 


0.8487 


0.8364 


0.8336 


0.9254 


0.9166 


0.9137 


0.4 


0.7237 


0.6889 


0.6766 


0.8609 


0.8395 


0.8312 


0.6 


0.6199 


0.5365 


0.5394 


0.8045 


0.7709 


0.7565 


0.8 


0.5331 


0.4599 


0.4261 


0.7547 


0.7107 


0.6908 


1.0 


0.4602 


0.3758 


0.3361 


0.7102 


0.6578 


0.6338 


1.5 


0.3236 


0.2305 


0.1894 


0.617 


0.551 


0.521 


2.0 


0.2319 


0.1455 


0.112 


0.544 


0.470 


0.440 


2.5 


0.1689 


0.0945 


0.072 


0.483 


0.407 




3.0 


0.1248 


0.063 




0.433 


0.358 




3.5 


0.093 


0.043 




0.39 


0.33 




4.0 


0.070 






0.35 






4.5 


0.054 






0.33 







"In units of mean service time. 
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We have that [see Equation (7.8)] 

P { X(x) = k}= H* k (x) - H*< k+l \x) 

(A: = 0,1,...; H*°(x) = \; x>0), (12.12) 

where H* k (x) is the £>fold convolution with itself of the service-time 
distribution function. From the theorem of total probability it follows that 

00 

Wj{x)= 2 W. k {x)P{X(x) = k] (7 = 2,3,...), (12.13) 

k = 

and hence our problem will be solved if we can determine the conditional 
probabilities Wj k (x) (J = 2,3,...; k = 0, 1, . . . ). 
Clearly 

^•,o(*)=l (7 = 2,3,...). (12.14) 

Let us define for k > 

H(i\k,x) = P { S x < i\X{x) - *} . (12. 15) 

It follows from the definition of conditional probability that 

We now have the following recurrence for k > 1 : 

= 2,3,...; A:=l,2,...). (12.17) 

The required function P{W>t\W>0} can now be determined in 
principle from Equation (12.9), complemented by Equations (12.7), (12.13), 
and (12.14) and (12.17). This method has the advantage over the 
MacLaurin-series method that it does not require any assumptions about 
existence or convergence of a Maclaurin series expansion of Wj(x), but 
whether it offers any computational advantages is problematical. For an 
outline of the application of this method to the M/D/X queue, see 
Exercise 29. 

Clearly, reduction to practice of these algorithms is difficult except in 
special cases. One may well argue that obtaining the kind of numerical 
results that engineers need from the formulae produced by mathematicians 
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is often more an art than a science. Recent progress in computational 
probability has been made by Neuts [1977] in utilizing phase-type ap- 
proximations of arbitrary distributions to obtain numerical results, and by 
Jagerman [1978] in methods for the numerical inversion of Laplace trans- 
forms. Of course, one can always resort to digital-computer simulation, 
which we shall discuss in Chapter 6. 

Exercises 

27. The Maclaurin- series method for the M/ C/l random- service queue (Carter and 
Cooper [1972]). Assume that Wj(x), defined by Equation (12.10), has the 
Maclaurin series expansion 

2 = 2,3,...; tff>-l). (1) 

Set 

Kx)=^H(x) (2) 

and 

b i {x) = h{x)p i {x). (3) 
Also, for any function f(x), define 




a. Show that 

- #(»+ 4j£=y 2* W y #}n±T k} 

i-oJ + I 1 k-i 

= 2,3,...; *-l,2,...). (4) 

[Equation (4) permits, in principle, evaluation of the sum on the right-hand 
side of (1).] 

b. Let F{t)= P{W>t\W>0}, and assume the representation 

H0= I (5) 

Show that Equation (12.9) yields the following formula for the coefficients 
{F (y) } in the expansion (5): 

+ g G<* +I )HK'- l -*> (*=1,2,...). (6) 

7 -2 J lc-0 J 



The M/G/l Queue with Service in Random Order 



259 



c Show that Equation (12.7) yields 



G^-G'W-^jy-, 0=1,2,...) (7) 



and 



(*-l,2,...; 7=1,2,...). (8) 

Thus Equations (4), (6), (7), and (8) permit calculation of the Maclaurin 
series representation (5) of the conditional waiting-time probability P { W 
>t\W>0}. Sample calculations according to this algorithm for the 
M I ' E k /\ random-service queue are given in Table 5.1. 

28. (Carter and Cooper [1972].) Suppose that 

H(x)-l-e- x ' T . (1) 

a. Show that for v = l, Equation (6) of Exercise 27 becomes 

/<!)-._ 1+ £ ^igO). (2) 

J-Z J 

b. Show that Equation (2) reduces to 

F c»„_ll^p ln i . (3) 

T p 1 — p v ' 

c Show that for v = 2, Equation (6) of Exercise 27 becomes 

F (2) = ^ + |ZZi ( 0.)^> + 2 ) ) . (4) 
T J "2 * 



d. Show that 



02) _ _ J. h^\^ Pj _ ,(0) + 2 IF/»y-i(0) J (5) 



and that this reduces to 



2 >=-^(l-pV (y = 2,3,...). (6) 

T 



e. Show that Equation (4) of Exercise 27 yields 



^-"77^1 = 2,3,...). (7) 
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f. Show that Equation (4) reduces to 

r i _ „ in 

(8) 



T 



2 -lzP ln 1 
P 1-P 



g. Compare the results (3) and (8) with the expansion (6) of Exercise 32 of 
Chapter 3. 

The additional-conditioning-variable method for the M/D/l random-service 
queue (Carter and Cooper [1972]). Suppose that 

#(*)-(? W u Cn X<T ' 0) 
v ' I 1 when x>t. 

a. Show that Equation (12.7) reduces to 

#(*)-{ r^-' when ^ (2) 
when |>t. 



b. Show that for t<r, Equation (12.9) reduces to 

P{W>t\W>Q} = \-- + - V tLLmf'w^t-Zydi 

T T . , J JQ 



(0</<t) (3) 



and, for f >t, 

^ ,-_■} J "'O 



(t</<oo). (4) 



c. Show that, since Wj(x) is a step function, the integrals on the right-hand 
side of Equations (3) and (4) can be evaluated, giving 

P{W>t\W>V)-\-± + ±- S ^J?-, 

T y-2 J 

(0<t<r) (5) 

and 

P{fK>(|fF>0}-i J^^-i{(<-[^] T )^ 

+ ( T _, + [l] T )rfr(,- T )} 

(r<f<oo), (6) 
where [x] is defined as the largest integer not exceeding x. 
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d. Show that 

*)-#(€). (7) 

e. Show that Equation (12.17) reduces to 

i — •* 

0=2,3,...; k=\,2,-..). (8) 

f. Show that Equation (12.12) reduces to 

fl when k=\-] 
P{X(x) = k} = { L T J (9) 

[ otherwise. 

g. Show that Equations (9) and (12.13) imply 

Wj(x)=W Mx/Ti (x) = 2,3,...). (10) 

h. Show that if we set k — \x/r\ in Equation (8), we obtain 

j'=0 •* 

= 2,3,...; x>r). (11) 

i. Show that Equation (11) is complemented by 

Wj{x)=\ 0=2,3,...; x<r). (12) 
j. Show that the recurrence (11) follows immediately from Equation (12.10). 

Equations (1 1) and (12) permit calculation of Equation (6). It can be shown in 
a straightforward manner that these results are equivalent to those of Burke 
[1959], who gives several useful sets of curves for the M/D/l random-service 
queue. 



5.13. Queues Served in Cyclic Order 

So far in this text we have been concerned with well-defined models. Quite 
often, however, applications require models that do not fit into the stan- 
dard molds and are not available in the literature in packaged form. Then 
the reader must develop his own model if possible. We give here an 
example of a model suggested by a practical problem that has received 
some attention in the literature, but is not one that would usually be called 
"standard." The point is that even though this model cannot be equated 
directly to a standard queueing model, it can be analyzed by the same 
techniques. 
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We shall first state the general problem, and then solve it for a particular 
special case. In so doing we hope to illustrate the underlying concepts 
while simultaneously avoiding inessential detail. The success of the method 
in the special case then suggests its own generalization. It is interesting to 
observe that in the literature, for reasons of economy and mathematical 
"elegance," such problems are usually presented in the most general form 
that the authors can handle. Such presentation often obscures the process 
by which the solution was reached, leaving the reader with only the 
answers and undue respect for the intelligence of the author. 

We consider a system of queues served in cyclic order by a single server. 
The /th queue is characterized by general service-time distribution function 
H t {x) and Poisson input with parameter \. The process begins with the 
arrival of a customer at some queue, say A , when the system is otherwise 
empty. The server starts on this customer immediately, and continues to 
serve queue A until for the first time the server becomes idle and there are 
no customers waiting in queue A. The server then looks at the next queue 
in the cyclic order, queue A + 1, and serves those customers, if any, that 
have accumulated during the serving period of queue A. The server 
continues to serve A + \ until for the first time the server becomes idle and 
there are no customers waiting in queue A + \. The process continues in 
this manner, with the queues being served in cyclic order, until for the first 
time the system becomes completely empty. The process is then reinitiated 
by the arrival of the next customer. No time is required to switch from one 
queue to the next. 

It should be pointed out that this description is quite precise, and, in 
fact, itself constitutes a large chunk of the solution. The original question is 
simply, how does one analyze a system in which queues are served in cyclic 
order? 

Systems in which a single server is shared among several queues are 
common. For example, in a time-shared computer system the users have 
access through teletypewriters to a central computer that is shared among 
them. Similarly, in some electronic telephone switching systems, the central 
control spends much of its time polling various hoppers and performing 
work requests that it finds in these hoppers. Another possible application 
of this model is in the design of a dynamically controlled traffic light at an 
intersection. 

These systems, of course, do not fit our model exactly. In particular, the 
assumptions (or lack of assumptions) made with regard to the various 
distributions, and the exact way in which the server moves among the 
queues, are the result of a tradeoff between the model's closeness of fit to 
reality and its mathematical tractability. 

We shall solve the problem posed above for the very special case of two 
queues, each characterized by the same general service-time distribution 
function H(x) and Poisson input with the same parameter A. We start by 
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writing probability state equations for the Markov chain imbedded at the 
instants at which the server finishes serving a queue. We call these time 
instants switch points. At a switch point, all the customers in the system are 
waiting in the queue other than that on which the server has just been 
working (because the server switches from a queue at the first instant the 
queue becomes empty). Since the two queues are identical, the "state" of 
the system at a switch point can be defined as the number of waiting 
customers, without regard for the identity of the queue in which they are 
waiting. 

Note that we have chosen to study this system at a particular set of 
renewal points, the switch points, which is a "smaller" set than that of the 
usual choice of renewal points, the set of all service completion points. It is 
not clear, a priori, that this choice will yield as much information as would 
a solution based on the set of all service completion points. (It turns out 
that it does.) But if one were to imbed the chain at the latter set of points, 
the definition of state would require two variables — one for the number of 
customers in each queue at a service completion point. 

Let P(k) be the probability that at an arbitrary switch point the number 
of waiting customers is k (k = 0,1,...). This event can occur through the 
following exhaustive and mutually exclusive contingencies: 

1. The server leaves a queue and finds j>\ customers waiting for 
service in the other queue, and during the time that the server spends 
at the latter queue, exactly k customers arrive at the first queue. This 
length of time is precisely a y-busy period, with distribution function 
Bj(t) = B* J (t) and corresponding Laplace-Stieltjes transform [B(s)Y. 

2. The server leaves a queue and finds no customers waiting in the other 
queue. When the next arrival occurs (at either queue) the server 
spends a 1-busy period there, and during that time k customers arrive 
at the other queue. 

These considerations lead directly to the imbedded-Markov-chain equa- 
tions for the equilibrium state probabilities: 



We now solve the equations (13.1) by the use of a probability-generating 
function g(x)\ 






(13.1) 



00 

g(x)= 2 P{k)x k . 



(13.2) 
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Substitution of (13.1) into (13.2) yields 

*(*)- 2 l 2 PU) r&£-e-»dB*>(t) 

j=l J ° k-Q KX 

which becomes, after summing over k, 

g(x)= 2 p (J) re-* l - x)t dB*i{t) + P(0) Ce-^-'^dBit). (13.3) 
j~\ J o J o 

The second integral on the right-hand side of (13.3) is simply the Laplace- 
Stieltjes transform B(s) of the busy-period distribution function, with 
argument s=\(l — x); similarly, the first integral on the right of (13.3) is 
the Laplace-Stieltjes transform with argument j = X(1 — x) of the ./'-fold 
convolution with itself of the busy-period distribution function. Since the 
transform of a ./'-fold self -convolution is the original transform raised to the 
y'th power, Equation (13.3) becomes 

*(*)- 2 P(J)[ B(\-\x)] J + P(0)B(\-\x). (13.4) 

But, with the addition of the term P(0), the sum in (13.4) is simply the 
generating function g(z) with argument z — B(X — \x). Hence, Equation 
(13.4) may be written 

g( fi(\ - \*)) - g(x) = P(0) [ 1 - fiQi - Xx) ] . (1 3.5) 

Equation (13.5) is a functional equation for the probability-generating 
function (13.2). It remains to solve Equation (13.5) and find P(Q). 
Let us now define the iteration procedure 

z v+l = B(\-\i v ) (* = 0,1,...; z -x). (13.6) 



[Notice that since z — x, the vth iterate z„ is a function of x; that is, 



Queues Served in Cyclic Order 



265 



z v = z v (x).] Successive use of (13.6) in (13.5) gives 



g(z l )-g(x) = P(0)(\-z l ), 



which, when added together, give 



V 



g(z p )-g(x) = P(0)^(\-z J ). 



(13.7) 



We shall now show that 



lim z v {x) = 1 (x < 1). 



(13.8) 



From the definition (13.6) of z„ as a Laplace-Stieltjes transform, we see 
that for any z = jc< 1, the numbers z l ,z 1> ... all lie in (0,1] and thus are 
bounded (by 1). Since g(x) is a probability-generating function, the 
sequence g{z{),g{z^) t ... is also bounded (by 1). Since Z],r 2 , ... all lie in 
(0, 1], the sum on the right-hand side of (13.7) increases monotonically with 
v. Therefore the left-hand side of (13.7), and in particular the sequence 
g(z 1 ),g(z 2 ), increases monotonically with v. Thus the sequence 
giz l ),g(z 2 ),... is both bounded and monotonically increasing, and there- 
fore has a finite limit. Hence, the left-hand side of (13.7) has a limit, which 
implies that the series of nonnegative terms on the right-hand side of (13.7) 
converges. But a necessary condition for the convergence of a series of 
nonnegative terms is that the »»th term must go to zero as v— >eo. Hence 
lim l ,_ >00 (l — z v ) = 0; that is, we have proved the assertion (13.8). 

Returning from our digression, we see that taking limits as v-*oo in 
Equation (13.7) gives, by virtue of (13.8), 



Using the normalization requirement g(l)= 1 in Equation (13.9), we have 



Setting jc = in Equation (13.10) and noting that g(0) = P(0), we obtain 



00 



*(D-*(*)-W)2(i-*,). 



(13.9) 



00 



g(*)-l-P(0) 2 [i-*/*)]. 



(13.10) 




(13.11) 
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The functional equation (13.5) has now been solved in the sense that 
g(x) can be evaluated [using (13.10) and (13.11)] for any x< 1. We can 
now use this fact to obtain some important characteristics of our queueing 
system. 

To obtain the mean number n^g'(l) of customers in the system at a 
switch point, we differentiate through the functional equation (13.5): 

-\pX\-)uc)gXP(\-)vc))-gXx) = }J>(0)PX\-\x). (13.12) 
Setting x = 1 in Equation (13.12) gives 

-\j3'(0)n-n = KP(0)/3'(0). (13.13) 

Recall that fi(s) is the Laplace-Stieltjes transform of the M/G/l busy- 
period distribution function, which has mean t/(1— Xt). Hence, — /8'(0) = 
t/(1— Xt), and Equation (13.13) yields 

" = P ^T^Xr- < 13 ' 14 > 

Note that (13.14) is meaningful only when 2Xt< 1. This is exactly what 
one would intuitively expect: A statistical-equilibrium distribution for this 
single-server system (of two queues) will exist if and only if the total load 
offered to the server is less than unity. Numerical evaluation of Equation 
(13.14) requires calculation of only one state probability, P(0), which can 
be computed to any desired degree of accuracy from Equations (13.11), 
(13.6), and (8.76). Observe that although numerical values of the Laplace- 
Stieltjes transform f$(s) are required for this calculation, inversion of the 
transform is not. (Note also that, in general, P(0)=£ 1 — 2Xt. Why?) 

Because of the symmetry in the special case we have considered here, 
the mean wait E{ W) for service to commence for a customer arriving at a 
given queue is given by Equation (8.39) with p = 2Xt. In the general case, 
however, the required formula is more complicated. It is worth mentioning 
that this general formula does not require any iteration for its numerical 
evaluation. 

Models similar to this have been studied by many authors. In particular, 
this model was used in a nontechnical article explaining queueing theory 
for the layman (Leibowitz, [1968]) as an example of an interesting and 
important queueing model. For further discussion and references, see 
Cooper and Murray [1969], Cooper [1970], Eisenberg [1972], Half in [1975], 
Kuehn [1979], and Stidham [1972]. (There is a subtle error in Cooper 
[1970]; for the record, we give the correct result in Exercise 31.) 
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Exercises 

30. The M/G/l queue with gating. An arriving customer who finds the server idle 
causes a gate to close. When this customer's service is complete, the gate 
opens and admits into a waiting room all those customers who arrived during 
this service time, and then closes. When all the customers in the waiting room 
have been served, the gate opens and admits into the waiting room all those 
customers who arrived during the collective service times of the preceding 
group of customers, after which it closes. The process continues in this 
manner. When the gate opens and finds no waiting customers, it remains open 
until it closes behind the next arrival. Show that the mean n of the number of 
customers who enter the waiting room when the gate opens is given by 



with 

Xj+, = t\(\-\xj) 0-0,1,...; x =0), 

where p is the utilization factor, A is the arrival rate, and r\{s) is the 
Laplace-Stieltjes transform of the service-time distribution function. 

31. Show that Equation (44) of Cooper [1970] is incorrect, and should be replaced 
by 

^<j)-(l-p)+_ pXi 



*"i-ib-\+VfcO)] 
x[g i -i(ri i (s),\,...,i)-g i - l (i-s\ i - l ,i,... t i)] 

(i-0,l,...,JV-l), 

where 

[Hint.Thc error is introduced in Equation (42).] 



5.14. The GI/M/s Queue 

In Section 5.8 we studied the M/G/l queue by writing equations relating 
the state probabilities to each other at successive renewal points, where, by 
definition, a renewal point is an epoch at which the future evolution of the 
system depends on the present state only, and not on the path by which 
that state was reached. We showed that in the M/G/l queue the service 
completion epochs, which are the instants at which customers leave the 
server, constitute a set of renewal points. We then wrote an equation 
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relating the state probabilities to each other at a pair of successive renewal 
points, namely the service completion points of the kth and (A; + l)th 
departing customers [Equation (8.13)]. Using a statistical-equihbrium argu- 
ment, we obtained an equation [Equation (8.20)] defining the statistical- 
equilibrium distribution {Ilf} of the number of customers left behind by 
an arbitrary departing customer, from which blocking and waiting-time 
information was subsequently obtained. 

We shall analyze the GI/M/s queue by the same technique: Identify a 
set of renewal points, relate the state probabilities at successive renewal 
points to each other, assume the existence of a limiting stationary distribu- 
tion, and solve the resulting system of equations. 

Suppose that customers arrive at epochs T V T 2 ,..., and assume that the 
interarrival times T k+l - T k (k = 0,l,...; T =0) are mutually independent, 
identically distributed, random variables, with common distribution func- 
tion G(x) = P ( T k + j — T k < x) (k = 0, 1 , . . . ), and mean interarrival time A ~ \ 
All customers wait as long as necessary for service. Let N k be the number 
of customers in the system just prior to the arrival of the kth customer; 
is the number of customers present at T k — 0. Since the input is recurrent, 
and since the service times are by assumption identically distributed 
exponential variables (with mean independent of the arrival epochs 

and each other, the arrival epochs T V T 2 ,.>. are renewal points. Hence, by 
the theorem of total probability, 

P{* k + i=J} = 2 P{^ + x=J\N k = i}P{N k = i} 

0'-0,l,...;*-l,2,...). (14.1) 

As was true with the M/G/l queue, the transition probabilities 

Pu = P{N k+i =j\N k ^i} 

depend on the indices / and j, but not on the index k. It can be shown 
using the theory of Markov chains that a unique stationary distribution 

n r lim P{N k =j] = 0,1,...) (14.2) 

exists if and only if p = X/sp.< 1. [Equation (14.2), for the GI/M/s queue, 
is the analogue of (8.19), for the M/G/l queue.] 

Therefore, after taking limits on both sides of Equation (14.1), it follows 
that the equilibrium probability state equations that determine the distribu- 
tion (14.2) are 

n J =f t Pi J n i 0=o,i,...), (14.3) 
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together with the normalization equation, 



2^-1- (14.4) 

y-o 



We now proceed to specify the transition probabilities {Py) and solve the 
equations (14.3) and (14.4) that determine the equilibrium arriving 
customer's distribution {IIj}. (Note that our analysis will yield the arriving 
customer's distribution directly. This is in contrast to the models previ- 
ously considered, in which the distribution originally obtained required 
further analysis before it could be interpreted as or translated into the 
distribution that corresponds to the viewpoint of the arriving customer.) 

To obtain the transition probabilities, note first that any arrival can find 
at most one more customer present than was found by the preceding 
arrival; hence 

PiJ = (* + W<0). (14.5) 

Now consider the case where the (& + l)th arrival finds all servers busy. 
Then the kth arrival could have found at most one server idle, and all s 
servers must have been continuously busy during the interarrival time 
T k+l — T k . Since the service times are assumed to be exponentially distrib- 
uted, with mean t = h~ 1 , it follows that 

P { * k+ 1 =JW k = l '> T k+\~ T k = x ) = (f+i'iy 

(i>s-l, J>s, /-r l-y>0); 

and, since the interarrival time T k+l — T k has distribution function G(x), 
therefore 

r 00 (suxY + 1 ~ J 

Py=f /Ti (i>s-l, j>s, i+\-J>Q). 

(14.6) 

It will turn out that for our purposes we need only the transition probabili- 
ties (14.5) and (14.6); we leave it to Exercise 34 for the reader to show that 
the remaining transition probabilities are given by 

(14.7) 
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and 

Pv ~ CfJ{j) e ~ Mx ~ yy v- e ~" x ~ y v~ J 

(i>s, j<s, i+\-j>0). (14.8) 

Let us now consider the equilibrium probability state equations (14.3) 
for the values of the index j >s; then substitution of (14.6) into (14.3) yields 

« r <x> ( S uxY + l ~ J 
i-j-i J o (i+l— y)! 

which, with a change of variable, becomes 
,=o ■'o l - 

Equation (14.9) is most easily solved by a serendipitous guess; we assume a 
solution of the simplest form that could conceivably be correct, namely, a 
geometric distribution: 

n j = Aoy J s (J>s-1). (14.10) 
Substitution of (14.10) into (14.9) yields the equation 

co= ("e-V-^dGix). (14.11) 

Observe that the right-hand side of (14.11) is the Laplace-Stieltjes trans- 
form y(z) of the interarrival-time distribution function, evaluated at z = 
(1 — u)sfi. Hence, Equation (14.11) can be written 

u = y((1-"M. (14.12) 

It can be shown (see, for example, Takacs [1962a]) that (14.12) has a 
unique root in (0, 1) whenever p=X/ sfi<\, and in this case the numerical 
solution can be found using the iteration procedure 

**+i = Y(0 -«>/*) 0=0,1,...; 0<<o <l). (14.13) 

Note that for Poisson input, the root to of Equation (14.12) equals the 
server occupancy; w = A/jju, = p. For this reason, w is sometimes referred to 
as the generalized occupancy, although the true occupancy, defined to be 
the carried load per server in erlangs, is still p = \/sji, and in general w#p. 
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We conclude that our guess (14.10) is indeed correct. It remains for us 
to calculate the constant A and the remaining state probabilities 
n s - 2 yn s _ 3 ,,..,n Q . The determination of these probabilities, described now 
by a finite number of equations, requires much labor. Since these probabil- 
ities are not required explicitly for most applications, we content ourselves 
here with the statement of a formula for the calculation of the constant A : 



where 



and 



Y, = Y(» = 0,1,...,*) (14.15) 



'-Ate) 



0=1,2,...,*). (14.16) 



A derivation following Takacs [1962a] of (14.14) and the probabilities 
I7 , Uj, . . . , II S _ 2 is outlined in Exercise 35. 

We now turn to calculation of the blocking probability and, for service 
in order of arrival, the length of time W that an arbitrary customer spends 
waiting in the queue for service to commence. An arriving customer will be 
forced to wait for service if on arrival he finds at least s other customers in 
the system, and this event has probability 

P{W>0}= f n r 
Hence, it follows from (14.10) that the probability of blocking is given by 

P{ W 7>0) = 1 A-. (14.17) 

[We leave it to Exercise 32 for the reader to verify that for Poisson input, 
the right-hand side of (14.17) reduces, of course, to the Erlang delay 
formula. This provides another illustration of the equality of the arriving 
customer's distribution {TIj} and the outside observer's distribution } 
for systems with Poisson input.] 

The calculation of the waiting-time distribution for the case of service in 
order of arrival closely parallels the corresponding calculation for the 
Erlang delay model, as done in Section 3.4. In particular, Equations 
(4.19)-(4.21) of Chapter 3 all remain valid for the GI/M/s queue. 



272 



Imbedded-Markov-Chain Queueing Models 



Corresponding to (4.22) of Chapter 3 we have 

P{Q=J\ W>Q}=n s+r AoS, (14.18) 

where the second equality in (14.18) follows from (14.10). Substitution of 
(14.18) and (14.17) into (4.21) of Chapter 3 yields the analogue of (4.23) of 
Chapter 3, namely, 

P{Q=j\W>0}=(l-a>W (y=0,l,...). (14.19) 

Since (14.19) is the same as (4.23) of Chapter 3 with p replaced by w, it 
follows that (4.24) of Chapter 3 will be valid for the GI/M/s queue when 
p is replaced by «; we conclude that 

P{ W>t\ W>0] = e-O-*^'. (14.20) 

Thus, for the equilibrium GI/M/s queue with service in order of arrival, 
the conditional (given that the customer is not served immediately) wait- 
ing-time distribution function is the negative exponential, with mean value 

E(JV\W>0)= f1 * . (14.21) 
(l — w)jju 

It follows from (14.17) and (14.20) that the (unconditional) waiting times 
are described by 

pf W>t \ = -A- e -0-*>W > (14.22) 

1—6) 



and, from (14.17) and (14.21), 



E(W)=- — . (14.23) 

( 1 — 0)) Sfl 

Of course, (14.21) and (14.23) are valid not only for the case of service in 
order of arrival, but for any nonbiased queue discipline. Calculations with 
(14.20) are facilitated by the use of the graph of Figure A.5 of the 
appendix. 

Finally, let us look at the relationship between the outside observer's 
distribution {Pj} and the arriving customer's distribution {12,}. Takacs 
[1962a] proves that 

Pj= ihj nj - 1 u=h2 '- ) - (1424) 

where \ _1 is the mean interarrival time, and iKj)—]^ when j<s and 
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KD — when j>s. If we write (14.24) as 



MIj-x-MPj = 1,2,...), 



(14.25) 



then we can make the following interpretation. The left-hand side of 
(14.25) equals the product of the arrival rate and the proportion of arrivals 
who find j—\ other customers present; hence it equals the rate at which 
the system moves from state Ej_ x to E r The right-hand side of (14.25) is 
the product of the service completion rate when there are j customers 
present (with exponential service times) and the proportion of time that 
exactly j customers are present; hence, it equals the rate at which the 
system moves from state E y to (just as we have argued previously for 
birth-and-death processes). Thus, Equation (14.25) states that "rate up = 
rate down." [For an interesting derivation of (14.25) using Little's theorem, 
see Heyman and Stidham [1980].] 

The development of this section follows that of Takacs [1962a], Chapter 
2. The reader is referred to Chapter 5 of the same book for a treatment of 
the corresponding model (renewal input, exponential service times) with no 
waiting positions. 

Exercises 

32. Verify that for Poisson input Equation (14.17) reduces to 



33. Prove that in a GI/M/s queue with service in order of arrival, the 
equilibrium conditional probability that a blocked customer will still be 
waiting in the queue when the next customer arrives is equal to the gener- 
alized occupancy w. (See Exercise 29 of Chapter 3.) 

34. Verify Equations (14.7) and (14.8). 

35. Derivation of ( 1 4. 1 4) and the probabilities JI , II X , . . . , JT, _ 2 (Takacs [ 1 962a]). 
a. Define the generating function 



P{W>0} = C(s,a). 



s — 



(i) 



and show, by using (14.3), that 





-Az s . 



(2) 
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b. Show that 



c. Define 



and show that 



IW-l-I^. 0) 



(4) 



and 

9-^ + 9-i*-^C')S£=F7 0-1,2 ,-1); 

that is, 

q.Vu^'WfVp" G-U,...,,-!), (6) 

where y, is defined by (14.15). 

d. To solve the difference equations (6), show that 

Vj^Vj-x a ( s\ sQ-yj)-J (1) 

Cj C J . l Cj(\-yj)\J) ' 

where C = 1 and Cj <J> is defined by (14.16). Add these equations for 
j - i + 1, . . . ,s - 1, and, using the fact that U s _ x — II S _ t =*Aoi ~ l , obtain 

*. A ± t-t'VVp^ (i .o,, ,-,). (8) 

Now set i-O in (8), and show that (8) and (5) together imply (14.14). 

e. To obtain the unknown probabilities JT^iT,,.. .,n s _ 2 , show that 

y-o 



and use the fact that 



do) 
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to deduce the final result, namely, 

Hj= J 2 !' V/ a-ai,...,*-2), (ii) 

where U t is defined by (8). 

5.15. The GI/ M/s Queue with Service in Random Order 

The GI/M/s random-service queue has been studied by LeGall [1962], 
Takacs [1962b], Burke [1967, unpublished], and Carter and Cooper [1972]. 
LeGall gives an expression for the characteristic function of the waiting- 
time distribution function; Takacs gives a similarly complicated expression 
for the corresponding Laplace-Stieltjes transform. Their results are quite 
formidable, and reduction to practice (through either mathematical or 
numerical inversion) does not appear simple. 

As we did in Section 5.12 for the M/G/l queue with service in random 
order, we shall here derive algorithms (the Maclaurin-series method and 
the additional-conditioning-variable method) for the numerical computa- 
tion of the complementary conditional waiting-time distribution function 
P{W>t\ W>0) for the GI/M/s random-service queue. Then the uncon- 
ditional waiting-time distribution can be found from the relation 

P{W>t} = P{W>t\W>Q}P{W>0}, (15.1) 

where P{ W>0] is given, for any nonbiased queue discipline, by (14.17). 

Let N be the number of customers present when an arbitrary customer 
(the test customer) arrives, and define 

Wj{t) = P{W>t\N=s+j} t/=0,l,...). (15.2) 

The probability that the test customer's waiting time W will exceed t can 
be written 

00 

P{W>t}= 2 P{W>t\N=i}P{N=i); 

i-0 

that is, 

P{W>t}= f W j (t)P{N=s+j} t (15.3) 

y-o 

where P{N = s+j} = P{Q=j,W>0} = Au> J , as given by (14.18). Hence, it 
follows from (15.1) and (15.3), together with (14.17) and (14.18), that 

P{W>t\JV>Q} = (\-(c) 2 <^Wj(t). (15.4) 

y-o 
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[Note that (15.4), which is valid for any nonbiased queue discipline, is the 
analogue of the result of Exercise 30 of Chapter 3. In particular, when 
service is in order of arrival, then 

"JO- 2 ^«-*. 

and (15.4) yields (14.20).] It remains to calculate Wj(t) for service in 
random order. 

We shall first give a heuristic derivation of the Maclaurin series expan- 
sion of the conditional probability P{ W>t\W>0}> and show that our 
results include Equation (6) of Exercise 32 of Chapter 3 (for the M/M/s 
random-service queue) as a special case. This method, which we call the 
Maclaurin-series method, results in an algorithm that may be computation- 
ally useful in some cases. 

We begin with the Maclaurin-series method (Carter and Cooper [1972]). 
Consider a test customer who, upon arrival at time T c , say, finds all s 
servers busy and j>0 other customers waiting for service. Then one of the 
following two events must occur: (1) the next customer arrives after time 
T c + t, or (2) the next customer arrives prior to time T c + 1. 

In case (1), the test customer will wait more than / for service to begin if 
and only if he is one of the / (1 < i <j + 1) customers still waiting for service 
at time T c + t. (If /' = 0, the test customer will necessarily have begun 
service.) Since service times are assumed to be independently exponentially 
distributed with common mean /a -1 , the probability p k (x) that exactly k 
customers complete service in an interval of length x, given that all s 
servers are continuously busy throughout this interval, is the Poisson 
probability 

pM-tel&e-** (*-(U....). (15.5) 

Now N is the number of customers present when the test customer arrives 
at T c ; let T c , be the next arrival epoch after T c . If T c .-T c >t, then the 
(conditional) probability that the test customer waits in excess of t for 
service to commence is 

J+l i 

P{W>t\N- s +j,T c ,-T c >t}= 2 7XT^ + i-/(0. 

where the factor i/(J+ 1) is the probability that the test customer is not 
among the j+l — i customers selected (according to the random selection 
procedure) to begin service during the interval (T C ,T C + 1). Since the 
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interarrival-time distribution function is G(x), and since T c is an arrival 
epoch, event (1) occurs with probability 1 — G(t). 

Now consider event (2); that is, suppose that the next customer arrives 
at time T c . <= T c 4- £, where £ < /. The probability that the test customer will 
be among the remaining / waiting customers is 

The test customer will now experience a total wait in excess of t for service 
to begin if and only if he suffers an additional delay exceeding length t— £. 
But since waiting customers are selected for service in random order, the 
probability that the test customer's additional waiting time will exceed t—£ 
(if he has not yet begun service) is the same as the probability that a new 
arrival waits in excess of t— £ for service to begin. The latter probability is 
JV^t — Q. Thus, if the next customer arrives at time T c , — T c +£, where £</, 
then the test customer's waiting time will exceed / with probability 

P {W>t\N=s+j,T c ,-T c =Z<t}= 2 —p (QjvAt-Q. (15.6) 

Finally, the probability that the next arrival epoch T c . will occur in an 
infinitesimal interval about the point T c + £ is dG(£). 

Therefore, combining events (1) and (2), we have the following re- 
currence for the conditional probability Wj(t) that the test customer waits 
in excess of t for service to begin, given that on arrival he finds all j servers 
busy and j>0 other customers waiting for service: 

Wj(t) = [l-G(t)]£ j^Pj+x-M 

+ 2 j^f'pj+i-iiWit-OdGte) (y=o,i,...), (15.7) 

where G(x) is the interarrival-time distribution function and p k (x) is given 
by Equation (15.5). 

We now assume that Wj(i) for the GI/M/s queue has the Maclaurin- 
series representation 

W-2£»? F) 0-0,1,...; Wf»-l). (15.8) 

Equation (15.8) is the same as Equation (4) of Exercise 32 of Chapter 3, 
except that (15.8) refers to the GI/M/s queue instead of the M/M/s 
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queue. In analogy with Equation (5) of that exercise for the M/M/s 
queue, we have the corresponding equation for the GI/M/s queue: 



00 jV 00 

p{w>t\w>o} = i + (i-o>) 2 ■^r'2oyw}"\ (15.9) 



,■1 " ! y=o 



It remains to determine the derivatives { Wj^) appearing on the right-hand 
side of Equation (15.9) from the basic recurrence (15.7). 
Differentiating v times on both sides of Equation (15.7), we have 

^W=^([i^(0]2^ +1 ^)) 



3" 

ikS)8KS)- 



, = 1 J f A jt=0 at 

= 0,1,...; *=1,2,...), (15.10) 
where g(g) is the interarrival-time density function, 

g{£)=~G(0. (15.11) 

For convenience we set 
and 

b J+l -,(t)-g(t)p J+l -M. (15.13) 

Now set f=0 in Equation (15.10). The integral on the right-hand side 
vanishes and we have 

»?"-<#».+ 2: TXT 'S # > .-i»?- , -* ) 

= 0,1,...; »=1,2,...). (15.14) 

The recurrence (15.14) permits evaluation of the sum on the right-hand 
side of Equation (15.9). The problem is solved if the series converges and if 
the assumed derivatives exist. If, in addition, the terms of the series are 
easy to calculate, the solution is also useful. 
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One might note the steps taken in the above development that are 
mathematically questionable. In particular, we assumed the existence of 
the Maclaurin series expansion (15.8) while ignoring possible effects on 
convergence of the values of t and «, the differentiability of FfJ(/), and the 
vanishing of the integral on the right-hand side of (15.10) at * = 0. It is 
because of these (not unreasonable) assumptions that we term this 
derivation "heuristic." The fact that the method gives agreement with an 
independently obtained result for the special case of Poisson input (see 
Exercise 36) is encouraging, but proves little. Clearly, work remains to be 
done. 

Exercise 

36. Show that for Poisson input this algorithm yields Equation (6) of Exercise 32 
of Chapter 3. 

An algorithm that does not require that the interarrival-time distribution 
function Cj(jc) possess a density function has been proposed by P. J. Burke 
[1967, unpublished]. This method, which is based on the introduction of an 
additional conditioning (or supplementary) variable, is essentially the same 
as the algorithm of Burke for the M/G/l random-service queue that we 
discussed in Section 5.12. According to Burke, this method yields results 
that appear to be well suited for computation on a digital computer. For 
the particular case of Poisson input, Burke's method yields a power series 
representation for P{W>t\W >0} that is easily shown to be convergent 
for all s(it>0. It follows from the uniqueness property of power series 
representations that the Maclaurin series (6) of Exercise 32 of Chapter 3 
exists and converges for all sfit > 0. 

We now briefly describe Burke's additional-conditioning-variable method 
for the GI/M/s random-service queue. The Maclaurin-series method was 
based on obtaining an expression for Wj(t) — P{ W >t\N— s+j) for j— 
0,1,..., where W is the waiting time of the test customer, and N is the 
number of customers present in the system just prior to the test customer's 
arrival epoch T c . Burke considers the additional random variable X(t), 
defined as the number of customers who arrive in (T c , T c + t], with t>0, 
and X(0)=0 with probability 1. Instead of calculating W^i), Burke's basic 
calculation is that of W jk {t), which we define as 

W Jtk (t) = P{W>t\N=s+j, X{t) = k) 

= 0,1,...; /c=0,l,...). (15.15) 

We have that 

P{X(t) = k} = G* k (t) - G* (k+1 \t) 

[&=0,1,...; G*°(t) = l; t>0], (15.16) 
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where G* k (t) is the k-fold convolution of the interarrival-time distribution 
function G(t) with itself. Also, from the theorem of total probability, 

00 

*}(')- 2 P{X{t) = k}W jtk {t). (15.17) 

Ar = 

Since the conditional waiting-time distribution function is determined by 
Equation (15.4), it follows from Equation (15.17) that it now remains only 
to determine the conditional probabilities W J k (t). 

First note that X(t)=0 if and only if' T c ,-T c >t. It follows from 
Equation (15.5) and the definition (15.15) of W j k (t) that 

^,o(0 = 'S 7^7/?,+ 0-0,1,...). (15.18) 

For k > 0, we define 

G{i\k 9 t) = P{T c ,-T c < £\X{t) = k), (15.19) 

which is the conditional distribution function of the elapsed time T c , — T c 
between the arrival epoch T c of the test customer and the next arrival 
epoch T c ,, given that k arrivals occur in (T c ,T c + t]. It follows from the 
definition of a conditional probability density function that 

^^ P{X %%^ dG(0 (*-U,..). (.5.20) 

Then, reasoning in a manner similar to that leading to Equation (15.7), we 
have the following recurrence f or k > 1 : 

= 0,1,...; k=l,2,...)- (15.21) 

Thus, using Equations (15.18) and (15.21) one can compute P{ W >t\ W > 
0} to any desired degree of accuracy. 

Complicated as these algorithms for the GI/M/s random-service queue 
are, they are clearly less complicated than the corresponding algorithms for 
the M/G/l random-service queue that are given in Section 5.12. This is a 
consequence of the fact that we analyzed the GI / M/s queue directly from 
the viewpoint of the arriving customer, whereas our analysis of the 
M/G/l queue required a "translation" from the viewpoint of the depar- 
ting customer to that of the arriving customer. 
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6.1. Introduction 

One can define a model as a description that eliminates nonessential detail 
and captures the essence of the system being described, and simulation as a 
methodology whereby a system is studied by observing the response of its 
model to artificially generated input. Our interest is in mathematical 
models of queueing systems, in which the model is a logical, not physical, 
representation of the system under study. Although the concept of simula- 
tion does not depend on the existence of a computer, it is the advent of the 
high-speed digital computer that has made simulation a practical tool of 
analysis. Simulation is a subject in its own right, but its application to the 
analysis of queueing models is of sufficient importance and closeness to 
queueing theory to warrant a chapter in this text. This chapter gives an 
introduction to the subject, much more in the nature of a survey than the 
other chapters of this book. Our intention is to raise the important 
questions, suggest some answers, and provide references for the reader who 
wishes to pursue the subject further. 

The preceding chapters of this book have all been concerned with the 
mathematical analysis of queueing models. These models were relatively 
simple, and presumably could provide adequate descriptions of simple real 
systems, or simple parts of complicated real systems. The construction of a 
model to describe a particular system is an art, not a science, and requires 
consideration of the tradeoff between the amount of detail about the real 
system incorporated in the model and the simplicity of the model. Roughly 
speaking, the inclusion of more detail will permit the analyst to draw more 
accurate inferences about the real system from its model if the analyst can 
solve the model, whereas the elimination of more detail will result in a 
greater likelihood that the analyst will, in fact, be able to solve the model, 
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thereby obtaining information about the model from which inference 
about the real system will be drawn. In other words, the analyst wants to 
construct a model that strikes a balance between accuracy and tractability. 
It is impossible to state in general which information about a real system 
should be included in its model and which should not. Most readers would 
probably agree, for example, that a model whose purpose is to predict the 
proportion of patients who will be denied immediate service in a hospital 
emergency room should not contain information on the positions of the 
stars (although the arrival rate may well depend on the phase of the moon). 
Less obvious to the naive observer is the assertion that the type of 
treatment required is relevant only through the distribution of the lengths 
of the treatment times. A more sophisticated observer would know that if 
the arrivals can be said to follow a Poisson process, and if the blocked 
patients do not wait but instead overflow to another facility, then only the 
mean of this distribution is relevant to the system's performance. Clearly, 
the type of model chosen to describe any given real system will depend on 
the knowledge, insight, and wisdom of the analyst. 

Digital-computer simulation permits, in principle, the analysis of arbi- 
trarily complicated models of real systems. Conceptually, it is no more 
difficult to simulate a complicated model than a simple one. This fact 
reflects both the strength and the weakness of the methodology. For, on 
the one hand, simulation provides a method of analysis that is applicable 
when mathematical analysis is too hard, and it allows the analyst to model 
a real system in a way that is more complicated (and therefore, presum- 
ably, more accurate) than mathematical analysis would permit. On the 
other hand, all other things being equal, simulation does not yield any- 
where near the insight and information that a mathematical model will 
yield if it can be solved. Because of the generality of simulation methodol- 
ogy (analysis of queueing models is only one of many kinds of applica- 
tions), and because of the difficulty of obtaining the specialized skills 
required for the construction and solution of mathematical models, there is 
a tendency for applied workers automatically to adopt simulation rather 
than mathematical analysis, regardless of the merits of the different ap- 
proaches with respect to the particular problem at hand. Of course, the 
optimal situation is when the analyst has competence in both simulation 
and mathematics, and can use either one or a combination of the two. 

Historically, the earliest use of the concept of simulation is usually 
credited to Count de Buff on, a French naturalist, who in 1777 stated what 
is now known as Buffon's needle problem: A needle of length a is thrown at 
random on a plane covered with parallel lines that are all a distance d 
apart (a<d). What is the probability p that the needle will intersect one of 
the lines? The answer is p=2a/ird. Buff on is supposed to have performed 
the experiment and, by setting the observed frequency equal to the 
probability p, to have calculated the value of it. This remarkable idea> the 
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calculation of a deterministic quantity from the data of a random experi- 
ment, became a practical methodology with the advent of digital com- 
puters to carry out logical operations with artificially generated data at 
high speed. This technique was first used during the development of the 
atomic bomb, for the numerical evaluation of integrals, where it was called 
the Monte Carlo method. It is not widely known, however, and therefore is 
worth mentioning here, that the first application of the concept of simula- 
tion for the analysis of systems subject to stochastic demands, such as we 
are discussing here, was done much earlier. In 1907 J. C. T. Baldwin 
described, in an engineering report for the American Telephone and 
Telegraph Company, what is probably the first modern simulation study. 
Baldwin's study, entitled "Report of Engineering Studies for Semi- 
Mechanical Switchboards," used artificial data to simulate telephone 
traffic and determine the loads that could be handled by operators 
providing service with a given average delay. Baldwin called his technique 
the " "Throw Down' Method." Although it did not, of course, use a 
computer, it embodied the essential concepts of a modern simulation 
study. The throwdown became a standard tool of teletraffic engineers. With 
the advent of computers, the concept of the throwdown became widely 
applicable, and the term simulation has come to be the accepted one to 
describe this methodology. 

Our objective in this chapter is to provide an overview of the use of 
simulation to construct and analyze models of real queueing systems. 
Hence, our discussion will be restricted to discrete-event digital simulation, 
which is concerned with models of systems whose states make only discrete 
changes as time passes. We shall try to illustrate the relative strengths and 
weaknesses of simulation as opposed to mathematical analysis of queueing 
models. We assume that the reader is familiar with the basic concepts of 
computers and computer programming. 

A simulation model is a description of the structure of the system under 
study. When the simulation program is run, the computer generates arrival 
times, service times, and any other stochastic variables that correspond to 
the demand on the system. In other words, each simulation run is an 
experiment in which artificial, computer-generated input data are fed 
through a logical structure whose response to the input data "simulates" 
the response that the real system would make to the same input data. 
Hence, the computer must have the capability of generating input data 
with any prescribed statistical characteristics, which are supposed to mimic 
the input to the real system. That is, digital-computer simulation requires 
algorithms for generating realizations of customer arrival processes and 
service times, such as Poisson input and exponential service times. 

Of course, before a simulation run can be made, the simulation program 
must be written. The details of the programming depend on the type of 
computer, its operating system, and the programming language used. In 
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particular, the analyst may have a choice between a general-purpose 
programming language, such as Fortran, and a simulation prograrnming 
language, such as gpss, simscript, or gasp. 

Since a simulation run is an experiment in which the response (output) to 
(artificial) data (input) is observed, this output will itself be statistical in 
nature, and will therefore require some statistical analysis for its interpreta- 
tion. This is the most critical part in the design and implementation of a 
simulation model, because it is often difficult to draw general conclusions 
and gain real insight (as opposed to merely noting what happened in a 
particular case) from output data. 

We shall discuss each of these issues in turn, emphasizing those points 
most relevant to simulation of queueing models, and referring the reader to 
other sources for further information on simulation in general. 

6.2. Generation of Stochastic Variables 

Suppose we wish to generate independent realizations of a random vari- 
able X whose distribution function is F x (x); that is, we seek a computer 
algorithm to select independent samples from a given distribution. As we 
shall see, it is sufficient to be able to generate values of a random variable 
U that is uniformly distributed on (0, 1). 

First, observe that F x (x) is a nondecreasing function of x, on a finite or 
infinite interval. For < u < 1 the "inverse" function F x l (u) is defined as 
follows: If F x (x) has a discontinuity at x = x Q and F x (x — )<u <F x (x Q ) t 
then define F x l (u) = x Q ; otherwise, let F x \u) be the realizable value of x 
for which F x (x) = u. Now define X according to the transformation 

X = F X \U), (2.1) 

where U is a random variable uniformly distributed on (0, 1). We now 
show that if the realization of X is generated by first realizing the random 
variable U and then calculating the value of X from (2.1), it will follow 
that for all x, 

P{X<x}=F x (x); (2.2) 

that is, the values of X calculated according to this inverse-transformation 
method will be samples from a population whose distribution function is 
F x (x). To prove (2.2), note that 

P{X<x} = F{F- l (U)<x}=P{U<.F x (x)}, (2.3) 

where the first equality in (2.3) follows from (2.1). Now recall from 
Equation (5.9) of Chapter 2 that if U is uniform on (0, 1), then 



P{U <u} = F u {u) = u (0<u<l). 



(2.4) 



Generation of Stochastic Variables 



285 



It follows from (2.4) that the right-hand side of (2.3) equals F x (x); hence 
(2.2) has been proved. 

For example, suppose it is required to generate a value of a random 
variable X that is exponentially distributed with mean value t. Then 

F x (x)=l-e- x/T (0<x<oo), (2.5) 

and therefore we take 

U=F x (X)=\-e- x ^; 

solving this for X, as in (2.1), we get 

X= -j\n(l- U). (2.6) 

We leave it to Exercise 1 for the reader to show that if U is uniform on 
(0,1), then so is 1— U. Therefore, we can replace (2.6) by the simpler 
calculation 

X=-rhiU. (2.7) 

Hence, we conclude from (2.7) that to obtain a realization x of a random 
variable X that is exponentially distributed with mean t as in (2.5), it 
suffices to obtain a realization u of a random variable U that is uniform on 
(0, 1), and then calculate x from the formula 

x=— t\&u. (2.8) 

In the simulation of queueing models, exponentially distributed random 
variables play a central role. In particular, a Poisson arrival stream can be 
generated by generating the (independent, identical) exponentially distrib- 
uted interarrival times. Similarly, various other distributions can be pro- 
duced by the method of phases (see Section 4.8), according to which the 
random variable in question is assumed to be composed of a sum of 
independent exponential phases. (An example of this procedure will be 
discussed in Section 6.5.) 

Exercises 

1. Show that if U is uniform on (0, 1), then so is 1 — U. 

2. Let X have the Erlangian distribution function of order n, 
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If t/|, U 2 ,..., U„ are n independent samples, each uniformly distributed on 
(0, 1), show that 

-(1/X)ln( £/,-••£/„) 
has the same distribution as X. 

It is convenient to consider separately the special case when X is a 
discrete random variable, with probability distribution P{X = x j }=p i 
(/= l,2,...,n). To generate a sample from this distribution, again simply 
generate a value u of a random variable U that is uniform on (0, 1), and 
take X = x ( if 

'S Pj<u< 2 Pj (/= 1,2,. ..,«), (2.9) 

where the empty sum is taken to be zero. For example, suppose we wish to 
determine whether the next arrival occurs before or after the completion of 
service of a customer presently being served. If the customers arrive 
according to a Poisson process with rate X, and the service times are 
exponentially distributed with mean p~ x , then the probability that the 
arrival occurs before the service completion is A/(A + jtt). Therefore, we 
say that the arrival occurs first if the realization u of the uniform random 
variable U satisfies 0<w<A/(A + /a), and the service completion occurs 
first if \/(X + n)<u<\. 

We have shown that if we can generate values of a random variable U 
that is uniform on (0, 1), then we can calculate from these values another 
set of values that represent samples from any distribution we want. 
Therefore, we need concern ourselves only with the question of how to 
generate the values of U. Fortunately, there are in existence easily accessi- 
ble subroutines that generate values that are uniformly distributed on 
(0, 1). These subroutines, which are called random-number generators, are 
generally included in scientific subroutine packages and simulation pro- 
gramming languages. Thus, they ought to be tightly written and well 
tested. (However, reliance on this supposition can be costly.) It is interest- 
ing to note that most random-number generators use algorithms that 
produce a deterministic string of numbers with the property that they 
appear to be random, in that they can pass statistical tests for randomness 
and independence. Because these "random" numbers are in fact a de- 
terministic sequence, they are often referred to as pseudorandom numbers. 

Although the inverse-transformation method is, in principle, always 
applicable, it is not necessarily the most practical way to generate sample 
values from a particular given distribution. For more information on 
random-number generation and its use in the inverse-transformation 
method, and on other methods of stochastic-variable generation, see 
Cheng and Feast [1980], Fishman [1973, 1978a], Knuth [1969], Lewis 
[1975], Lewis and Shedler [1978], Schmeiser [1977], and Yakowitz [1977]. 
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6,3. Simulation Programming Languages 

Simulation programming languages are high-level problem-oriented lan- 
guages that have been designed to facilitate the writing of simulation 
programs. They contain language statements and subroutines for certain 
procedures that are common to simulation programs, like random-variable 
generation and data collection, that are not as readily available in general- 
purpose languages such as Fortran, pascal, and pl/i. General-purpose 
languages are more flexible and can produce programs that run more 
efficiently than those written in simulation languages. On the other hand, it 
is much simpler to write a simulation in a simulation language. The choice 
of the type of programming language, and the particular language within 
that type, will depend on several factors, such as the availability and the 
programmer's knowledge of the various languages, as well as the character- 
istics of the model to be simulated. 

The most widely used simulation language is probably gpss (General 
Purpose Simulation System), a language that is particularly well suited for 
queueing problems, simscript is a widely available simulation language 
that is particularly comprehensive, gpss compares favorably with 
simscript in that it is simpler; on the other hand, it is less comprehensive. 
Neither is uniformly better than the other, gasp (General Activity Simula- 
tion Program), which is a package of Fortran subroutines specially 
written for simulation, has the advantage that it provides some of the 
benefits of a simulation language without requiring a programmer who 
knows Fortran to learn a completely new language. (Also, gasp iv has the 
capability to handle models whose states change continuously, not just 
discretely, in time.) 

The subject of simulation languages is a substantial one, far removed 
from queueing theory; the best we can do here is alert the reader to the 
existence of this body of knowledge and identify some of the major 
references. Fishman [1973, 1978a] discusses simulation programming lan- 
guages in general and gives detailed descriptions of gpss, simscript, and 
Simula, gpss is described in Schriber [1974], in Bobillier, Kahan, and 
Probst [1976], and, by its originator, in Gordon [1975], and again, along 
with simscript, in Gordon [1978]. simscript is described by its developers 
in Kiviat, Villanueva, and Markowitz [1973]. gasp iv is described by its 
originator in Pritsker [1974], and Pritsker and Pegden [1979] describe slam, 
which extends gasp iv to include queueing network models. Mention 
should be made of resq, which is a software package, consisting of an 
analytic part and a simulation part, that is designed specifically for the 
analysis of queueing networks of the type that arises in modeling for 
computer performance evaluation, resq is described in Sauer and 
MacNair [1977, 1978] and Reiser and Sauer [1978], but the program itself 
is presently available only within the IBM company. A good survey paper 
on simulation with some discussion of programming languages is Shannon 
[1975]. 
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6.4. Statistical Questions 

A simulation run is a statistical experiment in which the response of a 
logical model to artificial input data is observed. It differs from an 
ordinary statistical experiment, such as observing the response of a real 
system to real input, in that the simulation experimenter has more control 
over the input data and the model. That is, in a simulation experiment, the 
analyst can generate the input to have any specified statistical characteris- 
tics; likewise, the model can (and should) be reduced to its essential 
structure, without the extraneous factors that complicate understanding 
but have no real effect. The first problem, of course, is for the analyst to 
constuct a model that is optimally "clean" and yet captures the essence of 
the real system under study. Assuming now that the appropriate model has 
been constructed, the problem is to draw inference from the simulation 
output about the response of the model to the input data. The design and 
interpretation of any statistical experiment is a challenging task. In the 
case of a simulation experiment, the fact that the analyst has a cleaner 
model and greater control over the input would seem to promise that there 
should be a corresponding increase in the quality of the inference that can 
be drawn from the output data. One would expect, therefore, that simula- 
tion experiments will be more efficient and informative than real experi- 
ments with respect to drawing inference (about the model) from statistical 
analysis of the output data. It is probably fair to say that the general 
attempt to realize the potential of highly efficient and informative simula- 
tion output promised by the high degree of control over the simulation 
model and input has not yet been realized. This is an area of active 
research, but one that has not yet been especially productive in terms of 
useful procedures for practitioners. In this section we shall discuss some 
of these statistical questions (but, unfortunately, not much in the way of 
answers). 

In order to illuminate the kinds of statistical questions that arise, let us 
consider a structurally simple model about which we already know, 
through mathematical analysis, (almost) everything, namely, the Erlang 
loss model. That is, for the sake of example, we consider the model 
characterized by Poisson arrivals, s servers, and blocked customers cleared. 
It is a simple matter to write a program, even in Fortran, to generate 
Poisson arrivals with any given rate and service times from any given 
distribution, and to keep records of the number of customers blocked and 
the proportion of simulated time that all servers are busy. 

Since we must make an assumption about the distribution of service 
times, let us make the simplest assumption, namely, constant service times; 
and suppose that we are interested in ascertaining only the equilibrium 
load-loss relationship, that is, the fraction of arriving customers who find 
all servers busy, as a function of the offered load, after statistical 
equilibrium is attained. 
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The first question is obvious: How long will it take for statistical 
equilibrium to be reached? In theory, of course, statistical equilibrium will 
never be "reached," only approached asymptotically (unless the system is 
in equilibrium at time zero, which is impossible in the context of a 
simulation that must begin in some specified state, such as all servers idle). 
Unfortunately, little more can be said without the help of mathematical 
analysis (whose availability would probably preclude the necessity for the 
simulation). The practitioner's solution is simple: Ignore the first 100 (or 
1,000 or 10,000) arrivals in the final calculations. 

Now suppose, for the sake of argument, that we have determined how 
many arrivals are necessary to bring the simulation model to equilibrium. 
How many additional arrivals should be simulated to ensure that the 
observed fraction of blocked customers will be close to the "true" probabil- 
ity of blocking? In other words, how large should the sample size be? And 
is it better to make one long run or several short runs? 

Let's suppose that we have answered these questions (as they must be 
answered, whether rationally or not, in every actual simulation). What 
have we got? If we call the service time unity, and generate the arrivals 
according to a Poisson process with rate X [using (2.7) with t = A _1 to 
generate the interarrival times, for example], can we say that the observed 
fraction of blocked customers is an estimate of B(s, A)? Or is it an estimate 
of B(s,X), where X is the realized value of the arrival rate, that is, the ratio 
of the total number of arrivals generated to the total duration of simulated 
time? (Although X and X should be close, it is unlikely that they will be 
exactly equal.) 

Thus, we see that the numerical value generated by the simulation 
confounds several types of error, including (1) error caused by waiting only 
a finite length of time before declaring the system to be in statistical 
equilibrium (because no real system can ever attain statistical equilibrium), 
(2) sampling error caused by generating only a finite number of arrivals 
after equilibrium has allegedly been attained, and (3) error caused by any 
departure from randomness of the pseudorandom-number generator. If we 
adopt the pragmatic point of view that we have run our simulation long 
enough and have a pseudorandom-number generator that is good enough 
so that these errors are all negligible, then we will have obtained a single 
point on the load-loss curve. It is an easy matter, but perhaps quite 
expensive, to make enough runs to obtain enough points to construct the 
load-loss curve — for one particular value of s. To obtain a set of curves, 
such as those given in the appendix, would require repetition of this 
process for each value of s. (Actually, in the present example of the Erlang 
loss system, one could alleviate this problem somewhat by recognizing that 
each realization for s servers contains another realization for s — 1 servers. 
However, the results for each value of s would depend on the same stream 
of pseudorandom numbers, so that any error in this regard would have 
multiple effects.) 
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But suppose now that instead of wanting to obtain the load-loss curves 
for the case of constant service times only, we also wanted to consider 
exponential service times. One could then use the same simulation program 
as before, except that now the service times would be generated according 
to (2.7) instead of assigning each service time the same (constant) value. 
This would increase the variability of the response and thus, it seems 
reasonable to suppose, would require a larger sample size to produce 
results of equal accuracy. But, no matter how great the care and expense of 
the simulation, it is highly unlikely that the load-loss curves for exponential 
service times would be exactly the same as those for constant service times. 
Simulation could never tell us with absolute certainty that, in fact, the two 
sets of curves should be exactly the same and moreover, that the load-loss 
curves should be identical no matter what kind of service-time distribution 
we have. 

In a similar vein, a simulation analysis would almost certainly give 
different values for the proportion of customers who are blocked and the 
proportion of time all servers are busy, even though these two probabilities 
are equal. These examples illustrate the crudity of simulation analysis as 
compared to mathematical analysis; simulation should be used only when 
mathematical analysis is not feasible. 

One promising technique for increasing the efficiency of simulation is 
called the regenerative method. The idea is to identify an infinitely recur- 
ring state with the property that the future evolution of the system from 
this state is independent of the system's past history. The evolution of the 
system can then be viewed as consisting of a sequence of statistically 
independent and identical cycles, each cycle bounded by a return to this 
regenerative state. This method, when applicable, provides insight and 
some of the answers to the questions raised above. The power of the 
method lies in the fact that it creates a sequence of independent, identi- 
cally distributed random variables from correlated random variables, and 
thereby permits application of the central limit theorem (see any textbook 
in statistics) and its variants. 

According to the simplest version of the central limit theorem, if R v R 2 , 
... are independent, identically distributed random variables, with_com- 
mon mean E(R) and variance V(R), both finite, and if R(n) = 
(/?! H + R n )/n is the average value of a sample of size w, then 




(4.1) 



as n-*oo, where $>(/) is the standard normal distribution function, 




(4.2) 
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The standard normal distribution function &(t) is widely tabulated, and 
therefore, if the sample size n is sufficiently large so that the arrow in (4.1) 
can be replaced by an equals sign, useful information can be obtained 
about the relationship between the size of a sample and the accuracy of 
some inference made from the sample values. 

We shall illustrate the use of the regenerative method and the applica- 
tion of the central limit theorem by considering a simulation analysis of 
the simple single-server Erlang loss model. Our objective is to estimate^the 
probability JT that an arriving customer will find the server busy. Let TI{n) 
be the observed ratio of the number of arrivals who are blocked to the 
total number of arrivals in n cycles. Clearly, the state in which there are no 
customers present is a regenerative state. Let M i be the observed number 
of customers blocked during the ith cycle; that is, M t is the number of 
customers who arrive during the ith service time. Then, by definition, 

that is, 

n(«)— (4.4) 

\+M(n) 

where M(n) = (M l H + M„)/ n is the average (over n cycles) number of 

customers blocked per cycle. If we denote the distribution function of TUri) 

Fft n) {t) = P{n{n)<t), (4.5) 
then it follows from (4.4) and (4.5) that 

^(^pj^X^j. (4.6) 

Now, according to the central limit theorem, the random variable 

M{n)~E{M) 
y/V{M)/n 

has the standard normal distribution when n is sufficiently large. It follows 
from Exercise 21 of Chapter 2 that 



E{M)=\t 



(4.7) 
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and 



V(M) = \t+\ 2 o 2 , 



(4.8) 



where A is the rate of the Poisson input process, and r and a 2 are the mean 
and variance, respectively, of the service-time distribution function. Hence, 
we can write 



M(n)-a 



<x 



+\V) 



(4.9) 



where a = \r is the offered load; that is, 



P^M(n)<a + x^(fl+AV) j = $(*). (4. 10) 



Finally, comparison of (4.6) and (4.10) yields 



1-/ 



— a 



+XV) 



(4.11) 



Equation (4. 1 1) gives some interesting information about how the sam- 
ple size n or the service-time variance a 1 affects the accuracy of the 
inferences that can be drawn from the simulation output. For example, 
suppose we wish to compare the sample sizes required to estimate IT by 
JJ(«) with the same degree of accuracy in the cases of two different 
service-time distribution functions, with variances a 2 and o 2 , say. Accord- 
ing to (4.11), the statistical properties of the estimator IT(/i) will be 
the same in both cases if the argument of the standard normal distribution 
function on the right-hand side of (4.11), is the same for all t. In that 
case, it follows that 




(4.12) 



where «, and n 2 are the corresponding sample sizes. In particular, if we 
compare constant service times (o 2 *=Q) with exponential service times 
(a| = r 2 ), then (4.12) implies that 



« 2 = (1 + a)n l . 



(4.13) 
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According to (4.13), if a = 1 erlang, for example, then it requires exactly 
twice as many cycles to simulate the exponential-service-time model as it 
does to simulate the constant-service-time model with the same degree of 
accuracy (even though the true value of H is the same in both cases). 

If the analyst knew that the value of U is the same for every service-time 
distribution function, but did not know how to find II except through 
simulation, then he could conclude from (4.12) that he should restrict his 
attention to the case of constant service times, because it results in the 
minimum variance of the estimator II(n). (This idea, choosing an estimator 
that has a smaller variance than another estimator for the same parameter, 
is called a variance-reduction technique.) 

We can use (4.11) to investigate how accurately the statistic II(n) 
estimates the true probability II. In particular, if we consider the interval 
of half-length 8 centered at II, then the probability that the estimator TI(ri) 
lies within this interval can be calculated from (4.11): 



p{n-8<n(n)<n+8} 



n+8 



\-(n+8) 



— a 



Vi< a+AV 



n-s 



i-(n~8) 



(4.14) 



In our example we know that 



U= B(l,a) = 



a 



1 + ' 



(4.15) 



it is instructive to use (4.14) and (4.15) to investigate numerically how the 
sample size n and the service-time variance a 2 affect the accuracy of the 
estimator TI(n). (See, for example, Exercise 4.) 

^ Another interesting question is whether it is better to estimate IT by 
il( n) or P(n), where the latter is defined as the observed proportion of 
simulated time that the server is busy: 



X l + 



+ x_ 



+ y„+x* + 



(4.16) 



where X t and Y, are the observed values of the service time and the idle 
time, respectively, of the ith cycle. If we define the average values X(n) 
~(X X + ■•• +X n )/n and Y(n) = (Y l + • • • + Y n )/n y then (4.16) becomes 



P(n)=— . 

Y(n) + X(n) 



(4.17) 
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We leave it to Exercise 3 for the reader to show that the statistic P(ri) has 
distribution function 



1-/ 



VPS 



2_2 



(4.18) 



Equation (4.18), which is the analogue of (4.11), leads to the analogue of 
(4.14): 



P{II-S<P(n)<n+8} 



= 0> 



n+s 



\-(n+8) 



— a 



II- 8 



\-(n-8) 



— a 



(4.19) 



Numerical investigation of (4.14) and (4.19) shows that neither of the 
statistics IL(n) and P(n) is uniformly superior to the other. For example 
(see Exercise 4), when n = 100 cycles and the service times are constant, it 
turns out that 

P { n- 0.05 < JT( 100) < JT+ 0.05 } = P { II- 0.05 < P( 1 00) < JT+ 0.05 } = 95% 
when a = 1; when a < 1, then 

P{n-0.05<JT(100)<JT+0.05}< J P(JT-O.O5<i s (lOO)<JT+O.O5}; 
and when a > 1, then 

p{n- o.05 < jt(ioo) < n+ 0.05} >p {n-0.05 <p(ioo) < n+0.05}. 

In other words, when a = 1, then the estimators JT(100) and P(100) are 
equally likely (about 95%) to lie within 8 = 0.05 of the true value of II; 
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whereas JT is more accurately estimated by P(\00) than by JT( 100) when 
a< 1, and vice versa when «> 1 (at least, when a 2 = and 5 = 0.05). 

The statistical problems associated with the simulation of queueing 
models are quite similar to those associated with traffic measurement and 
performance measurement of real systems. For example, suppose that it is 
required to determine by measurement the loss probability of a real 
system. Is it better to estimate this loss probability by measuring the 
proportion of arrivals that are lost or the proportion of time that all servers 
are busy? If s = 1 and we are willing to agree that the arrivals follow a 
Poisson process and the service times are constant, then the above results 
imply that we should prefer JT to P if it appears that a > 1, and vice versa if 
it appears that a< 1. Here, even though we may know that (4.15) describes 
this system, we presumably don't know the value of the offered load a; 
and rather than measure a and then calculate II from (4.15), we measure 
JT directly. This question of whether JT or P is a better estimator of JT is 
discussed from the viewpoint of traffic measurement by Descloux [1965] 
for the case where the system being measured can be described by an 
j-server Erlang loss model with exponential service times. It is interesting 
that Descloux used simulation to^ verify an approximate formula he derived 
for the variance of the statistic JT. Some related papers on traffic measure- 
ment that might be of interest from the viewpoint of simulation are Benes 
[1961], Descloux [1973, 1975, 1976], Kuczura and Neal [1972], and Neal 
and Kuczura [1973]. 

A more conventional use of the regenerative method is in assessing the 
accuracy of a parameter's point estimate obtained from simulation output. 
Suppose, for example, that a simulation has been written that calculates a 
statistic whose value will be taken as an estimate of the mean waiting time 
in a certain queueing model. The analyst would like to know how close this 
point estimate is to the "true" mean value E( W) it is supposed to estimate. 
One way is to construct a confidence interval, which is an interval whose 
endpoints are calculated from the simulation output, and which will 
contain the true value of the parameter being estimated [in this example, 
E(W)\ with a prespecified probability a. 

Let N k be the number of customers who are served during the kih cycle, 
and let S k be the sum of the waiting times_of those customers, and define 
R k — S k — N k E(W). Let N(n),S(n), and R(n) be the average values (over 
n cycles) of these random variables. Then R(n) = S(n) — N(ri)E(W), and 
according to (4.1), 



S(n)-N (n)E( W) - E(S k - N k E{ W)) 



<t 



->«>(/) (4.20) 
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as /i-»oo. It can be shown (using the argument of Section 2.1 for example) 
that 



E{W) = 



E{S k ) 



thus, E(S k -N k E(W)) - and (4.20) becomes 



(4.21) 



S(n)-N(n)E(W) 



-N k E(JV)) 



(4.22) 



Expanding the variance term in (4.22), we have 

V(S k - N k E{ W)) = V{S k ) + E 2 ( W) V(N k )—2E( W)Cov(S k ,N k ). 
If we define 



n 1 Jt-i \ N (n) / " 1 Jfc-i 



-2 



^-^tS («))(** -"(*))> 



(4.23) 



then K(/z) is a consistent estimator (see any text in statistics) of V(S k — 
N k E(W)), and (4.22) remains true when V{S k - N k E(W)) is replaced by 
V(ri). Thus we can say that, for sufficiently large «, 



P{ S(n)-N(n)E(W) <t ) m 
Vv(n)/n J 



(4.24) 



It follows from (4.24) that 



P \m _ -j-Jm <E{W) < + ^ A /^r i 

[N(n) N(n) V « . N(n) N(n) V w j 

=2&(t)-l. (4.25) 

Therefore, if t a is the value of t for which the right-hand side of (4.25) 
equals a, that is 

2Q>{t a )-\ = a, 
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then 

/ S(n) , . S(n) , g/ .\ 
\ N(n) N(n) J 

is a 100a% confidence interval for E(W), where 



S(«,a) = 




(4.26) 



Thus, in an «-cycle simulation of a regenerative queue, the mean waiting 
time E(W) is estimated as the ratio S(n)/N(n); and for any specified a, 
the interval defined by 

4^ ± «(«,«) 

N(n) 

will, with probability a, contain the true value of the mean waiting time (if 
the number n of cycles is sufficiently large). 

In summary, it is clear that the regenerative method, when applicable, 
gives much insight and provides answers to several of the questions posed 
earlier in this section. (In particular, note that with the regenerative 
method the question of how many arrivals must be generated before 
statistical equilibrium is attained is moot.) Unfortunately, as our examples 
indicate, the method is often difficult to apply to those models that are not 
simple enough for mathematical analysis. Some interesting progress in this 
area has been made by Lavenberg and Sauer [1977], who use the regenera- 
tive method to estimate confidence intervals for response variables for 
relatively complicated queueing models, and give sequential stopping rules 
that terminate a simulation run when the relative width of an estimated 
confidence interval reaches a prespecified value. This is an area of active 
and promising research. The interested reader should begin by consulting 
the aforementioned paper, the following papers, and some of the other 
papers referenced therein: Carson and Law [1980], Crane and Iglehart 
[1974a, 1974b, 1975], Crane and Lemoine [1977], Gunther and Wolff 
[1980], Heidelberger [1977, 1978a, 1978b], Heidelberger and Iglehart [1979], 
Heidelberger and Lewis [1980], Heidelberger and Welch [1980], Iglehart 
[1978], Iglehart and Shedler [1978], Lavenberg [1978], Lavenberg and Slutz 
[1975], and Schruben [1980]. The regenerative method is not the only 
method for predetermining run length and/or calculating confidence inter- 
vals from simulation output. For another approach (based on the method 
of batch means), and also a list of references that discuss other methods, 
see Fishman [1978b] and Law and Carson [1979]. 
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Exercises 

3. Derive Equation (4.18). 

4. Consider simulation of the single-server Erlang loss model with constant 
service times. 

a. Show that if the offered load is a — 1 erlang, then for all values of 5, 
asymptotically, 

I* {77- 8 < 77(h) < 77+ 8} — P {77- 8 <P(n) < 71+ 5 }, 

where 17 is the loss probability, and 77 (n) and P(n) are the estimators of 77 
defined by Equations (4.4) and (4.17). 

b. Tabulate the values of P{ 77 -0.05 < 77(1 00) < 17+0.05} and P{ 77- 0.05 < 
^(lOOXIT+O.OS) for 77= 0.05, 0.10, 0.15, 0.20,..., 0.95. On the same set of 
axes plot the graph of each of these probabilities versus 77. 



6.5. Examples 

In this section we discuss two examples of simulation studies of queueing 
models. These particular examples were chosen because, despite their 
structural simplicity, they illustrate some important points in simulation 
methodology, and illuminate the characteristic strengths and weaknesses of 
simulation analysis. 

Waiting Times in a Queue with Quasirandom Input and Constant 
Service Times 

As the reader is by now well aware, queueing theory contains many results 
that are, at first glance, puzzling and even absolutely counterintuitive. It is, 
of course, a tautology that no truth is counterintuitive to one who under- 
stands it. Nevertheless, it is probably safe to assume that even the reader 
who has advanced this far in this book retains some naivete about 
queueing theory, and therefore it is worthwhile to discuss a case in which 
simulation analysis uncovered a very puzzling anomaly. 

The model under consideration is characterized by n sources generating 
quasirandom input with rate y per idle source, s servers with constant 
service times, blocked customers delayed, and service in order of arrival. 
This model has applications in teletraffic engineering. Of particular interest 
in these applications is knowledge of the equiKbrium waiting-time distribu- 
tion as a function of the server occupancy. (The server occupancy, of 
course, is a strictly increasing function of y.) C. J. Durnan and J. H. Weber 
[1961, unpublished] used simulation to study the waiting times in this 
model. (Their results are discussed also in Durnan [1967].) We will briefly 
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describe the logic by which this model can be simulated, the validation of 
the simulation program, and the results obtained by Durnan and Weber 
and their consequences. 

We begin with the programming logic. Let us suppose that an event, 
either an arrival or a service completion, has just occurred in the simula- 
tion and all associated bookkeeping has been done. Let n—j (0<j<n) be 
the number of idle sources, and let y be the time remaining until the next 
service completion is scheduled to occur. Now the simulation determines 
the type of event and the elapsed time to the next event. To do this, the 
simulation generates a value x from the exponential distribution with mean 
[(n—j)y]~ 1 (which can be implemented according to (2.8), with r = [(n — 
J)y]~ ')• If * <y, the event is an arrival: The simulation records are updated 
to reflect this fact; the simulation clock is advanced by the amount x; and 
a new value is drawn from the exponential distribution, now with the mean 
[(n—j — l)y]~'. If x>y, the event is a service completion: The simulation 
records are updated; the clock is advanced by the amount y; and a new 
value x is drawn from the exponential distribution, now with mean 
[(n—j+l)y]~ l . This procedure is repeated (with obvious modifications 
when j = or j = n) until the simulation is deemed to have run long 
enough. 

Durnan and Weber wrote such a simulation, with similar or equivalent 
logic, to obtain information on waiting times when service is in order of 
arrival. In order to validate their simulation, they ran it for some special 
cases for which mathematical results were available, such as the case of 
s — l server (Harrison [1959]). Also, the equilibrium distribution of the 
number of customers in service when n — s is known [and is given by the 
binomial distribution that results from setting s = n in (7,5) of Chapter 3]. 
Furthermore, it would be easy (and wise) to write the simulation so that 
the service times can be assigned values drawn from an exponential 
distribution instead of being assigned a constant value. Then the simula- 
tion results can be checked against the formulas derived in Section 3.8. In 
a similar fashion, Durnan and Weber checked their simulation results 
against mathematical formulas for various models similar to the one they 
were studying. Their results seemed to agree with theory in every case they 
tried. They concluded that their model was valid (that is, there were no 
logical bugs and the statistical procedures were satisfactory) and proceeded 
to make simulation runs to study the original model. 

In particular, they used their simulation output to plot curves of the 
proportion (probability) of requests whose delay exceeded t service times 
versus the observed server occupancy. It is intuitively obvious to even the 
most casual observer that this probability must be a strictly increasing 
function of the server occupancy. Nevertheless, their results indicated that 
there were values of t,n, and s for which this probability decreases when 
the server occupancy increases within a certain range. 
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Figure 6.1. 

Figure 6.1 reproduces some of the curves obtained by Durnan and 
Weber. The curve corresponding to t = 0.7 for the model with n = 3 sources 
and s~1 servers gives a hint of nonmonotonicity, while the curve for 
t = 0.5, n = 4, and s = 3 shows the anomaly clearly. 

The Durnan-Weber dip, as it came to be called by its friends, caused 
much consternation and gnashing of teeth. It was assumed, of course, that 
the dip was a manifestation of an error. The question was, what kind of 
error? The first reaction was to increase the run length to minimize the 
effect of sampling error. But larger sample sizes only strengthened the 
likelihood that the dip was really there, and not a spurious effect of 
statistical noise. Then the search turned to the random-number generator 
and to the programming logic. But the random-number generator seemed 
to perform acceptably in every other case, and the model was so simple 
that it seemed unlikely that a logical bug could escape the careful scrutiny 
to which the program was subjected. It is a credit to the skill and integrity 
of the investigators that they did not simply overlook the dip, or fudge it 
away, or present the curves only in the ranges where the dip was not 
found, or ascribe the dip to "experimental error" and ignore it. Instead, 
they withheld publication until they could understand what was going on. 
Eventually, mathematical verification of the dip was forthcoming, as 
reported in Durnan [1967]: "Although no intuitive justification has been 
advanced to support this phenomenon, its existence was subsequently 
verified by P. J. Burke who used numerical methods to obtain the delay 
distributions for the special case of three sources on two servers." 

Although Burke's numerical work verified the existence of the dip, it did 
not explain it, and the problem remained in the air. A description of the 
numerical calculations for the case when n = 3 and s = 2 was later given in 
Burke [1979], as well as an intuitive justification for the existence of the dip 
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and the following conjecture: 



lim F(y,t) = 

y— >oo 



1 when t < 0.5, 
when f>0.5, 



where F{y, t) is the equilibrium probability that an arbitrary request waits 
more than / service times for service to commence when the request rate 
per idle source is y. Half in [1979] proves that "Burke's conjecture is right 
(except perhaps for r = 0.5), even under weaker assumptions about the 
arrival process." Halfin's interesting paper is perhaps best summarized by 
its abstract: 

The behavior of queueing systems with two servers and one waiting room is 
investigated. It is shown that if the service time is constant, then the difference 
between the times to service completion of the two servers (phase difference) 
tends to a constant, for increasing input intensities. This phenomenon holds 
for a wide class of arrival processes, but not when the service time has even a 
small variability. These results imply that the delay is not stochastically 
monotone in the input intensity. In general, we conjecture that the behavior of 
the phase differences, and delays, depends on whether the size of the waiting 
room is even or odd. 

In summary, this example illustrates a case where an entirely unsus- 
pected and mathematically interesting result was uncovered through a 
simulation study of a simple model, conducted as part of an engineering 
study of a real system. In this case simulation and mathematical analysis 
worked together to yield results of both practical and theoretical interest. 
One can only speculate on the number of counterintuitive facts that have 
remained buried in simulation studies of more complicated models, or in 
simulation studies performed by less competent investigators. 

Response Times in a Processor-Sharing Operating System 

The methodology of simulation is usually used to study (models of) real 
systems that are deemed too complicated for mathematical analysis. That 
is, a choice is made between what are assumed to be mutually exclusive 
methodologies, with simulation being the methodology of choice to study 
complicated, real systems, and mathematics being the methodology of 
choice to study simple, abstract models. Of course, a certain amount of 
mathematics is necessarily embodied in a simulation analysis, but it is 
unusual for simulation methodology to be used for the stated purpose of 
answering "mathematical" questions. In our discussion of the queue with 
constant service times and quasirandom input, we saw that a simulation 
study stimulated mathematical analysis of a particular model by uncover- 
ing a counterintuitive fact, but this was inadvertent. We now turn to an 
example in which simulation was used to investigate an essentially 
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mathematical question, namely, how the response times (sojourn times) 
reflect the variability of the processing times (service times) in a model of a 
computer operating system. An interesting point about this use of simula- 
tion is that the model in question, which provides an upper bound on the 
performance of a real computer operating system, is obtained as a 
mathematical limit of a sequence of models, each of which can, in 
principle, be simulated; but the limiting model itself is not physically 
realizable, and therefore cannot be simulated directly. We begin with a 
description of the round-robin scheduling algorithm, which is one of the 
best known and most widely used operating-system scheduling algorithms 
for time-shared computer systems. According to this algorithm, the 
processor allocates to each job a fixed amount of time, called a quantum or 
time slice. If a job's service time (the total time required from the 
processor) is completed in less than the quantum, it leaves; otherwise, it 
feeds back to the end of the queue of waiting jobs, waits its turn to receive 
another quantum of service, and continues in this fashion until its total 
service time has been obtained from the processor. 

This model, which has the pleasant property that it tends to yield a 
shorter response time (the total time required for the job to receive its full 
service time from the processor) for a job with a shorter service time, was 
first analyzed by Kleinrock [1964]. An important advance was made when 
Kleinrock [1967] studied the same model in the limiting case where the 
quantum length goes to zero. (He called this model processor sharing, 
because it is equivalent to the assumption that all k jobs in the mix are 
being processed simultaneously at l/k times the rate at which a single job 
would be processed if it were the only job in the mix.) This greatly 
simplified the mathematics, and led to the remarkable conclusion that in 
the processor-sharing model, the mean response time for all jobs whose 
total required service time is / is proportional to t. More precisely, 
Kleinrock's 1967 paper showed that under the assumptions that (1) the 
jobs arrive in the mix according to a Poisson process, (2) the service times 
are exponentially distributed, and (3) there is one processor, then the mean 
value R x (i) of the equilibrium response time for a job whose service time is 
t is given by 

Rl {t)=-^-t, (5.1) 

where a is the product of the arrival rate and the mean service time. The 
range of applicability of (5.1) was considerably widened when Sakata, 
Noguchi, and Oizumi [1971] showed that assumption (2) is superfluous; 
that is, (5.1) is valid for any distribution of service times whatever. (For 
further discussion see Kleinrock [1976], and also Coffman and Denning 
[1973].) 
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The above considerations raise the question about further generaliza- 
tions of (5.1). In particular, what is the mean response time for a job whose 
service time is t in a processor-sharing model with an arbitrary distribution 
of service times and an arbitrary number s of processors? It is reasonable 
to conjecture that this quantity — call it R s (f) — does not depend on the 
form of the service-time distribution function (because it has been proved 
so for s= 1, and is clearly true for s = oo; it would be surprising if such a 
dependence would appear only for the intermediate values 1 < s < oo). It 
is also reasonable to conjecture (and we shall explain why shortly) that 
R s (t) is given by the following formula: 

where C(s,d) is the Erlang delay formula, given by Equation (4.8) of 
Chapter 3. Thus, we conjecture that in an j-server queueing model char- 
acterized by Poisson input and processor-sharing queue discipline, the 
equilibrium mean response time for a job that requires a total of / units of 
service time is given by (5.2), where a is the offered load. 

To see why the right-hand side of (5.2) is a reasonable guess for R s (t), 
observe that the processor-sharing queue discipline is what we have previ- 
ously termed "nonbiased," and therefore, when the service times are 
exponential, the mean response time R s , say, is the same as the sum of the 
mean service time and the mean waiting time in the corresponding Erlang 
delay system [see Equation (4.27) of Chapter 3]: 




Since 

rR s (t)dH{t) (5.4) 
J o 

for any service-time distribution function H(t), it follows [from substitution 
of (5.2) into (5.4)] that our hypothesis (5.2) is consistent with (5.3), at least 
for the case of exponential service times. Equation (5.2) retains the prop- 
erty of linearity in t exhibited by (5.1), and reduces to (5.1) when s=l. 
These observations all support the hypothesis that (5.2) is, in fact, the 
generalization of (5.1) for the case of arbitrary s and arbitrary H(t). 

In order to further study this hypothesis, a simulation program that 
would measure the response times and compare them with the values 
predicted by (5.2) was written as a project in a course in simulation (and 
reported at a conference in New York; an abstract is given in Cooper and 
Marmon [1978]). The service times were modeled by the method of phases; 
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that is, if X is the random variable that represents the total required service 
time of an arbitrary job, then X has distribution function P{X<x} = 
H n (x) given by the Erlangian distribution function of order n: 

H n {x)=\- "± l ^e~^ (5.5) 

where n is the number of phases, each with mean and variance fi~ 2 . 
Hence, if we take ju = n we have 

E(X) = l (5.6) 

and 

V(X)-±; (5.7) 

that is, as the parameter n is assigned the values 1,2,..., the mean service 
time remains constant at one, while the variance of service times runs from 
one to an arbitrarily small number. This corresponds to a spectrum of 
service-time distributions, ranging from high variability (n= 1, exponential 
service times) to no variability («= oo, constant service times), all with a 
mean value of unity. This model does not include all possible service-time 
distributions, of course, but it does provide a wide range of variability of 
service times. One might reasonably expect that if the mean response time 
does indeed depend on the form of the service-time distribution function, 
then the simulated response times will show a dependence on the value of 
the parameter n used in that particular run. Besides providing an easy way 
to describe a family of service-time distributions with different variances, 
the use of the method of phases to describe the service times has another 
important property in this application: It permits the adoption of a 
particularly simple simulation algorithm, based on the properties of the 
birth-and-death process. We emphasize that this algorithm gives an exact 
representation of the processor-sharing model, not an approximation for a 
"negligible" value of the time slice. This is consistent with the objective of 
using simulation to test a mathematical formula that is conjectured to 
describe exactly a mathematical model that is itself a physically unrealiz- 
able idealization of the round-robin operating system. 

We now describe the simulation algorithm. Suppose there are k jobs 
present in the system. Then each job is receiving service at rate /a if k < s 
and rate sfx/k if k>s, where /a" 1 is the mean duration of an (exponentially 
distributed) phase of service. If we define 



f kfx (k = Q,l,...,s), 
\ sfi (k = s+l,s + 2,...), 



(5.8) 



Examples 



305 



then the probability p(k) that the next event is a phase completion is 

and the probability q(k) that the next event is an arrival of a new job in the 
mix is 

q(k)=\-p(k) (*-(U,...). (5.10) 

where A is the rate of the Poisson arrival process. If the event is a phase 
completion, then the probability that it corresponds to any given job is 
\/k. Random numbers are generated and decisions as to the kind of event 
generated are made according to these probabilities. Records are kept as to 
the number of jobs present and the number of phases of service completed 
by each job in the system. The time between events is calculated by 
drawing a sample value x = x(k) from an exponential distribution with 
mean [\ + r(k)\~ \ where k is the number of jobs in the mix during the time 
between events. Thus, whenever an arrival or phase completion occurs, the 
cumulative response time for each job present is increased by the amount 
x(k). If k<s, each job's cumulative service time is also increased by the 
same amount x; but if k>s, each job's cumulative service time is increased 
by an amount x x = (s/k)x. 

Thus, whenever a job leaves the system, it has two times associated with 
it — the actual response time experienced by the job and the actual amount 
of service time t that the job received. From these data the emulation 
calculates, for a range of values of t,$ t and a, the average value R s (t) of the 
response time for a job that received (required) t units of service. 

Finally, the values of R s (t) produced by the simulation are compared 
with the hypothesized corresponding theoretical values of R s (t) given by 
(5.2). Some examples of these comparisons are given in Figures 6.2, 6.3, 
and 6.4, where R s (t) is plotted against R^O) for n = 1, 2, and 10 and s = 1, 2, 
and 3. If the hypothesis (5.2) is correct, each of these graphs should be the 



Figure 6.2. 
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Figure 6.4. 



straight line with unit slope, with any deviation attributable solely to 
statistical sampling error and roundoff error. Figure 6.2 corresponds to the 
case 5 = 1, for which case, according to Sakata, Noguchi, and Oizumi 
[1971], Equation (5.2) is correct. Inspection of Figures 6.3 and 6.4 shows 
the same quality of agreement between simulation and theory that Figure 
6.2 shows. This indicates that the deviations from the straight line with unit 
slope in Figures 6.3 and 6.4 are manifestations of the same source of error 
that appears in Figure 6.2, namely, statistical and roundoff error. In 
summary, the simulation results support the conjecture that (5.2) is correct. 

In conclusion, this simulation study not only provides the usual kind of 
engineering information about the performance of a system as a function 
of the relevant parameters, but, more interestingly, illustrates the use of 
simulation as a complement to mathematical analysis in the theoretical 
study of mathematical models. 

As an interesting postscript, we observe that even stronger evidence for 
the hypothesis (5.2) is available: In a relatively obscure paper, presented at 
a conference in Hawaii, Sakata, Noguchi, and Oizumi [1969] proved 
Equation (5.2). (Surprisingly, these authors did not reference their 1969 
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paper in their 1971 paper, which is on the same subject and is much more 
accessible.) In addition, J. W. Cohen presented a paper at a 1977 con- 
ference in the Netherlands that, independently, also arrived at (5.2) (an 
abstract is given in Cohen [1978]). Finally, R. Schassberger has pointed out 
to this author that formula (5.2) can be obtained after much calculation as 
a special case of formula (12) of his unpublished paper, Schassberger 
[1978c]. Needless to say, each author's work was done with no knowledge 
of the work of any of the others. 
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To the casual observer, the very idea that so simple-minded a concept as 
that of a queue should form the basis of a mathematical theory to which 
otherwise rational people devote theiT professional lives may seem 
ludicrous. Perhaps. But, as the reader will have recognized by now, 
queueing theory is an unending subject to which this book provides only 
an introduction. It is the purpose of this chapter to provide a guide to the 
textbooks, reference works, tables, and research journals on queueing 
theory. 

We begin with a list of textbooks on queueing theory. The list includes, 
to the best of the author's knowledge, every English-language hardcover 
textbook devoted primarily to queues, as well as some other books that fall 
outside this category. Each book is referenced by author, date, and title; 
additional publishing information is given in the References. 

Bear [1976], Principles of Telecommunication-Traffic Engineering. The appli- 
cation of probability (and queueing) theory in the engineering and 
administration of telephone systems. 

Beckmann [1968], Introduction to Elementary Queuing Theory and Telephone 
Traffic. Short outline of basic formulas and applications to telephone 
traffic engineering. 

Conolly [1975], Lecture Notes on Queueing Systems. Lecture notes from a 
graduate-level course. 

Cox and Smith [1961], Queues. Short and elegant overview of the theory, 
with an eye toward application. Especially valuable is the carefully 
selected list of references in an appendix. 

Fry [1928, 1965], Probability and Its Engineering Uses. Probably the first 
reference and/or textbook to treat queueing theory (mainly in the 
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context of telephone traffic theory). Remarkable in 1928, but the 

second edition, 1965, adds little. 
Giffin [1978], Queueing: Basic Theory and Applications. A "primer on 

queues ... for practitioners rather than theoretical mathematicians." 
Gross and Harris [1974], Fundamentals of Queueing Theory. Designed for a 

graduate-level course in operations research. 
Hillier and Stidham [1981, in preparation], Queueing Theory: Describing 

and Optimizing Queueing Systems. 
Kleinrock [1975], Queueing Systems, Volume I: Theory. Designed for a 

graduate-level course. Prepares the reader for Volume II: Computer 

Applications. 

Kosten [1973], Stochastic Theory of Service Systems. Treats some of the 
standard models and variants, some of which have a telecommunica- 
tions flavor. 

Lee [1966], Applied Queueing Theory. Short exposition of theory followed 

by illustrative case studies. 
Mina [1974], Introduction to Teletraffic Engineering. A manual of telephone 

traffic engineering practice, with examples, graphs, and tables. 
Morse [1958], Queues , Inventories and Maintenance. Probably the first 

widely used and influential book on the subject. Clear explanation of 

mathematical concepts and applications. Still useful as a textbook. 
Murdoch [1978], Queueing Theory: Worked Examples and Problems. "It is 

designed to meet the needs not only of management science training 

programmes, but also of mangement (sic) teaching programmes." 
Page [1972], Queueing Theory in OR. Outlines the derivation of formulas 

for some classical queueing models, gives numerical examples and 

problems. 

Panico [1969], Queuing Theory: A Study of Waiting Lines for Business, 
Economics, and Science. Totally confused and confusing. 

Ruiz-Pala, Avila-Beloso, and Hines [1967], Waiting-Line Models. 

White, Schmidt, and Bennett [1975], Analysis of Queueing Systems. Well- 
written textbook for applications-oriented students with an engineer- 
ing background. 

The following are important reference works and monographs. 

Benes [1963], General Stochastic Processes in the Theory of Queues. An 

attempt to present the theory of single-server queues simultaneously to 

mathematicians and engineers. 
Benes [1965], Mathematical Theory of Connecting Networks and Telephone 

Traffic. Reprints of papers originally published in The Bell System 

Technical Journal. 

Borovkov [1976], Stochastic Processes in Queueing Theory, "... an attempt 
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to set forth the contemporary state (as it appears to us) of the 
mathematical theory of queues from the most unified and general 
standpoint possible." 

Brockmeyer, Halstrem, and Jensen [1948], The Life and Works of A. K. 
Erlang. An especially interesting and important historical work, which 
should be on the shelf of every afficianado. 

Cohen [1969], The Single Server Queue. A massive, rigorous mathematical 
treatment, relying heavily on complex-variable theory, of the classical 
single-server models and some of their variants. Contains substantial 
preliminary material on stochastic processes and an extensive bibliog- 
raphy. 

Cohen [1976], On Regenerative Processes in Queueing Theory. Views the 
classical models in the light of recent results in the theory of regenera- 
tive processes. 

Conway, Maxwell, and Miller [1967], Theory of Scheduling. About one- third 
of this book concerns queueing theory. 

Franken, Konig, Arndt, and Schmidt [1981], Queues and Point Processes. 

Gnedenko and Kovalenko [1968], Introduction to Queueing Theory. Empha- 
sis on theory. References many Russian-language papers. 

Jaiswal [1968], Priority Queues, "...addressed primarily to research stu- 
dents ..." 

Kelly [1979], Reversibility and Stochastic Networks. Discusses the concept 
of reversibility, including its application to the analysis of networks of 
queues. 

Khintchine [1969], Mathematical Methods in the Theory of Queueing. A 
short, rigorous, mathematical study of some classical models and the 
properties of the underlying stochastic processes. 

LeGall [1962], Les systemes avec ou sans attente et les processes stochas- 
tiques, Tome 1. 

Neuts [1981], Matrix-Geometric Solutions in Stochastic Models — An Al- 
gorithmic Approach. 

Pollaczek [1957], Problemes stochastiques poses par le phenomene de forma- 
tion d "une queue d 'attente a un guichet et par des phenomenes apparentes. 

Pollaczek [1961], Theorie analytique des problemes stochastiques relatifs a un 
group de Ugnes telephoniques avec dispositif d 'attente. 

Prabhu [1965], Queues and Inventories. "The purpose of this book is to give 
a unified treatment of [their basic stochastic] properties." 

Riordan [1962], Stochastic Service Systems. Queueing theory with an em- 
phasis on topics relating to teletraffic theory. 

Saaty [1961], Elements of Queueing Theory. A text and summary of scattered 
papers and monographs, with an extensive bibliography. 

Schassberger [1973], Warteschlangen. 
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Syski [1960], Introduction to Congestion Theory in Telephone Systems. Encyl- 
opedic. Treats standard queueing models and background probability 
theory; contains summaries of a large number of papers relating to 
teletraffic theory, extensive bibliography. 

Takacs [1962a], Introduction to the Theory of Queues. A rigorous mathe- 
matical treatment of some classical models. Contains exercises and 
their solutions. 

There are many textbooks on methods of operations research and manage- 
ment science, most of which devote at least a chapter to queueing theory 
and its applications. These chapters might provide an overview, but in 
most cases this is poorly done. Two exceptions are 

Hillier and Lieberman [1980], Introduction to Operations Research , 3rd ed. 
Wagner [1975], Principles of Operations Research, 2nd ed. 

Some books concerned with applications of queueing theory in computer 
science are 

Allen [1978], Probability, Statistics, and Queueing Theory, with Computer 
Science Applications. "... a junior- senior level textbook on applied 
probability and statistics with computer science applications." 

Coffman and Denning [1973], Operating Systems Theory. Chapter 4 treats 
queueing theory. 

Courtois [1977], Decomposability: Queueing and Computer System Applica- 
tions. A monograph on queueing networks and computer system 
performance evaluation. 

Ferrari [1978], Computer Systems Performance Evaluation. Chapter 4 dis- 
cusses queueing models. 

Kleinrock [1964b], Communication Nets: Stochastic Message Flow and De- 
lay. Probably the first application of queueing theory to the analysis of 
store-and-forward communication networks. 

Kleinrock [1976], Queueing Systems, Volume II: Computer Applications. 
Queueing theory applied to computer performance evaluation and to 
the design and analysis of computer-communication networks. 

Kobayashi [1978], Modeling and Analysis: An Introduction to System Perfor- 
mance Evaluation Methodology. Contains material on queueing theory 
and its application to computer performance evaluation. 

Sauer and Chandy [1981], Computer Systems Performance Modeling. 

Queueing Network Models of Computer System Performance, a special 
issue of ACM Computing Surveys (Vol. 10, No. 3, September 1978) 
devoted entirely to performance evaluation of computer systems, with 
focus on queueing network models. 
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Queueing theory with an eye toward application in transportation en- 
gineering is presented in 

Newell [1971], Applications of Queueing Theory. 

Newell [1973], Approximate Stochastic Behavior of n- Server Service Systems 
with Large n. 

Newell (1979], Approximate Behavior of Tandem Queues. 

Two books that are not nominally about queueing theory, but contain 
substantial amounts of related material are 

Takacs [1960a], Stochastic Processes: Problems and Solutions. A collection 
of challenging problems imbedded in a small amount of text, with 
complete and detailed solutions. 

Takacs [1967], Combinatorial Methods in the Theory of Stochastic Processes. 
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The number of papers on queueing theory that have been published since 
the seminal work of Erlang is enormous. One would think that there would 
be little left to say, but it seems that new papers are appearing at an 
ever-increasing rate. Following is a list of journals that regularly carry 
papers on queueing theory or its application: 

Advances in Applied Probability. Contains review and expository papers, 
as well as mathematical and scientific papers of interest to probabilists. 
Each issue invariably contains at least one paper on queueing theory. 
The Bell System Technical Journal. Occasionally contains papers on 
queueing theory, usually relating to teletraffic theory or engineering. 
IBM Journal of Research and Development. Occasionally contains 
papers on queueing theory and its applications in computer science. 
IEEE Transactions on Communications. Regularly publishes papers on 
queueing theory and its application to communication system analysis. 
IEEE Transactions on Software Engineering. Regularly publishes papers 
on queueing theory and its application to computer system analysis. 
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Applied Probability, it contains "research papers and notes on applica- 
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theory; they are not case studies. 

Journal of the Association for Computing Machinery. Often contains 
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Journal of the Operations Research Society of Japan. Often contains 
papers on queueing theory. 

Management Science. Emphasizes the methodological and practical 
aspects of the development and implementation of management sci- 
ence models. Averages about one paper per issue related to queueing 
theory. 

Mathematics of Operations Research. Averages slightly less than one 
paper per issue on the mathematics of queueing theory. 
Naval Research Logistics Quarterly. Occasionally contains papers on 
queueing theory. 

Operations Research. Emphasizes practice, operational science and 
methodologies. Averages about one paper per issue related to queueing 
theory. 

Opsearch. The journal of The Operational Research Society of India. 
Often contains papers on queueing theory. 
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Proceedings of the International Teletrqffic Congress. Held every third 
year beginning in 1955. Many papers on the application of probability 
and queueing theory to teletraffic theory and engineering. (Available 
from several libraries throughout the world, including the Engineering 
Societies Library, 345 East 47 Street, New York, New York 10017, 
U.S.A.) 

Studies in Congestion Theory. A continuing series of tables and reports 
published by the Institute of Switching and Data Technics at the 
University of Stuttgart. 



Appendix 

Engineering Curves for Systems 
with Poisson Input 



In the following figures, the number of servers is denoted by s, and the 
magnitude in erlangs of the offered load is denoted by a (a=\/n, where A 
is the customer arrival rate, and ju~ 1 is the mean service time). 
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Figure A.5. Conditional probability P{ W>t\ W>0} = € -V-»*i« that a blocked 
customer waits beyond t for service to commence in a delay system (blocked 
customers wait until served) with exponential service times and service in order of 
arrival, plotted against spJ, for different values of the generalized occupancy to. For 
Erlang delay systems (Poisson input, exponential service times), io^p^a/s. To 
calculate the unconditional probability P{W>t) that a customer waits beyond / 
for service to commence in an Erlang delay system with service in order of arrival, 
use the formula P{W>t} = P{ W>t\W>0}C(s t a), where C(s,a) is determined 
from Figure A.3 or A.4. 
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